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ABSTRACT 

An on-line adaptive technique is developed to provide a self- 
contained redundant -sensor navigation system with a capability to 
utilize its full potentiality m reliability and performance The gyro 
navigation system is modeled as a Gauss -Markov process, with degra- 
dation modes defined as changes m characteristics specified by para- 
meters associated with the model The adaptive system is formulated 
as a multistage stochastic process: a detection system, an identification 
system and a compensation system It is- shown that the sufficient statis- 
tics for the partially observable process in the detection and identification 
system is the posterior measure of the state of degradation, conditioned 
on the measurement history 

A suboptimal detection system in the class of linear systems of 
Wald's sequential analysis is developed The detection system is formu- 
lated as a combined control and decision problem by use of the concept 
of information value for detection and feedback of the uncertainty infor- 
mation of degradation indicated by the posterior measure The suboptimal 
control law is shown to be determined by a constant threshold number If 
the posterior measure is less than the threshold, the control corrects to 
the threshold Otherwise no control is used. The developed system is 
very simple for on-line implementation The system shows remarkably 
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close performance, expressed as mean time delay in detection under 
the constraint of a specified mean time between two false alarms, to 
that of the non-linear optimal detection system Moreover, the system 
can detect the degradation with simultaneous identification of its un- 
known polarity, which is shown to be an important piece of information 
for efficient isolation of the degraded instruments 

It is shown that the detection process can be effectively constructed 
on the basis of a "design" value, specified by mission requirements, of 
the unknown parameter in the real system, and of a degradation mode in 
the form of a constant bias jump. An invariant transformation is derived 
to eliminate the effect of nuisance parameters in the identification system 
It is shown that the ambiguous multi-class identification process can be 
transformed into a set of pairwise disjoint simple hypothesis tests A 
technique of decoupled bias estimator is applied in the compensation sys- 
tem such that the adaptive system can be operated without any complicated 
reorganization 
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CHAPTER 1 


Introduction 


1 1 Background 

Adaptation, as the ability of self-adjustment in accordance with chang- 
ing conditions of environment or structure, is a fundamental attribute of 
living organisms With the advance of technology considerable interest has 
been shown in the design of machines with the desirable attribute of adap- 
tivity The concept of adaptive systems has found applicability in diverse 
areas of communication, control, reliability and pattern recognition The 
historical survey of the development of adaptive systems can be found in 
extensive references and will not be undertaken here Instead, a brief 
review will be presented from the point of view of methods of achieving 
adaptivity which have been discussed in the literature 

A simple feedback system with a high gain m the forward path may 
operate quite adequately in a changing environment However, such a sys- 
tem may be more properly called insensitive or invariant rather than a- 
daptive 

Another approach to design the system with parameter adjustments 
programmed against measured operating conditions For those systems 
a rather comprehensive knowledge of the dynamic characteristics of the 
controlled process as functions of its operating environment is required 
This information is often difficult and expensive to obtain The programmed 
compensation is open-loop with respect to final system performance 

More effective adaptive systems employ some measure of the system 
performance as a basis for adaptation The systems are provided with 
the capability of self -monitoring their performance, and the self-adjustment 
is closed-loop with respect to performance The adaptive systems in this 
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class may be designed either by parameter adjustments to achieve some 
specified characterization of its dynamic properties (e. g a specified 
damping ratio of the system), or by direct computer control to achieve 
an optimal performance criterion. The adaptive action usually consists 
of three phases 

(1) Measurements to indicate present performance or deviations 
of performance from a desired or an optimal reference. 

(2) Interpretation of the performance measurements to generate 
an indication of necessary corrective action 

(3) Adjustment of system parameters to drive the actual system 
performance toward the desired performance 

So far the concept of an adaptive system has been discussed in 

rather vague terms, and there is no definition at present of an adaptive 

system which meets with general acceptance. Tsypkim ' suggested 

the following definition, which is a reasonable point of view with regard 

to adaptation for engineering systems. 

A process will be called adaptive when the parameters and 
structure (and, if necessary, the control signals) of a system 
are adjusted so as to use an accumulation of incoming infor- 
mation in such a way as to achieve some specified goal. This 
goal is usually some kind of optimization of a system that is 
initially poorly defined. 

Adaptive system design is concerned with system models where a 
great deal of uncertainty is involved. Thus the system must be treated 
in such a way that one can learn about the nature of the unknown elements 
as the process evolves. The multistage aspects of the process must be 
made to compensate for the initial lack of information Secondly, the ob- 
servable data will be of certain information patterns with imbedded fea- 
tures The problems involved in choosing useful information and in ex- 
pressing this information in reasonable analytic form are usually of great 
subtlety and difficulty Thus the study and processing of information flow 
in the adaptive system is of primary importance Finally, the adaptive 
system must employ some criterion, usually some measure of optimality, 
as a basis for adaptation 
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The analysis of features of adaptive systems points out that the tech- 
niques of modern decision, estimation, and optimization theory can be 
used to effectively design the adaptive systems The theory of dynamic 
programming, sequential analysis, conditional Markov processes and 
pattern recognition can provide systematic techniques for formulating 
problems and effective methods of solution of adaptive system design. As 
a basis for further discussion, some significant results obtained by inves- 
tigators in these areas will be briefly described 

Formulation of the optimization problem as a multistage decision pro- 
cess was accomplished by Bellman He applied the well-known imbed- 
ding technique of dynamic programming to derive the recursion formulas 
which must be satisfied by the optimal decision rule The property basic 
to applicability of dynamic programming is that decisions can be calculated 

sequentially by achieving the decomposition of the original problem into 

( 54 ) 

subproblems. Mitten has proved that the monotonicity and separability 
conditions of the criterion functions are sufficient conditions for decom- 
position 

In his development, Bellman assumes that the system state is a Markov 
process and that the state can be measured without error As a result of 
these assumptions the optimal decision becomes a function of the known 
state The concept of control (decision) of the system with partially obser- 

/cm 

vable data using the Bayesian approach has been discussed by Shiryaev ' 

The problem of designing an optimum closed -loop dual-control system 

/ 9 n \ 

is formulated by Felcibaum The controls are necessary not only to 

control the system to the required goal but also to investigate the character- 
istics of incomplete information about the system The concept of combined 
optimization is fairly simple, but the required analysis is rather involved 
However, in some cases^ the combined optimization problem can be decom- 
posed into separable stages A well-known result is the so-called separation 
theorem for linear systems with quadratic criterion and Gaussian input such 
that the problems of estimation and control may be solved independently 
In many cases suboptimal algorithms must be used A commonly used sub- 
optimal scheme is the open -loop -optimal feedback control discussed by 
Dreyfus 
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Sequential analysis is a method of statistical inference whose character- 
istic feature is that the decision to terminate the experiment depends, at 
each stage, on the results of observations previously made (Blackwell and 
Girshick^). The sequential probability ratio test, devised by Wald^ 80 ^ 
is a particular method of sequential analysis For testing a simple hypothesis 
against an alternative the procedure possesses the optimum property of 

minimizing the expected sample size for specified error probabilities 

( 83 ) 

Wald and Wolfowitz' proved the optimum character of the sequential 
probability ratio test based on a Bayesian formulation Some general re- 
sults of sequential analysis are given by Whittle 

( 22 ) 

Dvoretzky et al , extended the discrete sequential theory of tes- 
ting hypotheses to those problems about continuous stochastic processes 

( 14 ) 

Darling and Siegert applied the solution of first passage times for homo- 
geneous diffusion processes to problems in sequential analysis Aside from 
the difficulty of justifying certain limiting operations, the continuous procedure 
often leads to simpler analysis of the actual discrete process 

While the traditional problems of sequential analysis usually assume 
that the unknown parameter does not change with time, the progress made 
in the general theory of stochastic control processes with partial observa- 
tions can deal with the case when this parameter is itself a random process 
In this case the problem in sequential analysis can be considered as one in the 
theory of stochastic control with the simplest control --choice of the moment 
at which to stop observations, the problem of determination of optimal stopping 
rules. 

The theory of sufficient statistics for data reduction with the preserva- 
tion of the information is of particular importance for the general theory of 
adaptive processes A special interest in this aspect is concentrated on the 
case when the system can be modeled as the Markov process In study of 
Markov processes, if only some function of the state of a Markov process is 
directly measured, the fundamental entity m system optimization problems 
is the posterior conditional probability for the state given the observed mea- 
surement history. The study of these posterior probabilities and their evo- 
lution in time is the subject of the theory of conditional Markov process 
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There is extensive literature in the theory of Markov processes, for 

( 18 ) ( 73 ) ( 72 ) 

example, Doob, Stratonovich and Skorokhod It was shown 

by Kolmogorov that the probability density of a continuous Markov system 

obeys a partial differential diffusion equation. This result was formally 

( 42 ) 

extended by Kushner to systems with continuous noisy measurements 

Some results of stochastic differential equations of nonlinear filtering for 

( 88 ) ( 70 ) 

Markov jump processes are given by Wonham and Shiryaev An 

important problem arises in connection with the simulation of Ito stochastic 
processes The problem of the convergence of ordinary integrals to the 
Ito stochastic integral has been treated by Wong and Zakai. 

For discrete systems the theory of stochastic optimal control based 
on the principles of dynamic programming is sufficiently established In 
generalizing the theory to the continuous time system it is difficult to 
prove the existence of the optimal control Kushner and Fleming 
considered continuous optimization problems for the case of completely 
observable controlled diffusion processes, and presented results on the 
existence, uniqueness and sufficient conditions of the optimal stochastic 
controls. A theory of e -optimal controls for the intermediate system be- 
tween cases of discrete system and continuous system has been discussed 
( 73 ) 

by Stratonovich 

The results m pattern classification have found useful .application in 
control system design where a great deal of uncertainty is involved Ho 
et al and Sklansky^"^ present a survey and analysis of this field for 
applications in automatic control There are essentially two fundamental 
problems in pattern classification, the characterization and the abstraction 
problem The characterization problem can be simply but vaguely stated 
as finding a transformation from the original pattern to a set of features 
such that the feature states adequately characterize the original problem 
for purposes of classification but with much smaller dimension The ab- 
straction problem is the determination of a decision function of these fea- 
tures based on the data of given patterns such that new samples of unknown 
patterns can be reliably classified 

Based on the level of uncertainty of information four types of available 
data can be specified 
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(1) Functional form of the conditional density of features for the 
classes is given to within the specification of a set of unknown 
parameters 

(2) The values of the parameters are also known. 

(3) A set of training sample patterns of known classification is 
given. 

(4) Sample patterns of unknown classification are given; the de- 
cision rule has to be determined only on the basis of observed 
but unclassified patterns. 

The procedure to process the given data for calculating the decision 
function of classification is known as training, adaptation, or learning 
Depending on the types of available data, classification of algorithms 
can be broadly divided into two groups, probabilistically based and non- 
probabilistically based The main learning tool for the first group is the 
recursive application of Bayes 1 rule, while that of the latter is the itera- 
tive solution of an optimization criterion 

An important and interesting application of adaptive systems is to im- 
prove reliability of complex systems. One design philosophy to improve 

reliability is to provide the systems with the capability for self-repair. 

(75) 

Based on the principle of ultra -stability proposed by Ashby, Tsien 
discussed the possibility of building into the system a certain adaptability 
so that incidental and unexpected malfunction will automatically be corrected 
for by the control system itself without external human aid Whitaker and 
Kezer^ 85 ^ discussed the use of the model reference adaptive system com- 
bined with redundant control channels to improve the reliability of an air- 
craft flight control system 
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1. 2 Description of the Problem 

The primary goal of this research is to develop a technique of adaptive 
system design to improve reliability by detecting, identifying and compensa- 
ting performance degradation of a system modeled as a Gauss -Markov pro- 
cess. In particular the approach attempts to illustrate the application of 
modern system optimization theory to adaptive system design The designed 
adaptive system is applied to a self-contained redundant- sens or navigation sys- 
tem intended for use in long duration planetary space flight missions 

1. 3 Summary of Contents 

Chapter 2 treats the modeling of the problem The configuration of 
the redundant-sensor attitude system is described, and the system mission 
is defined In a long term free-fall space mission, the gyro bias drift 
rate is considered to be the most important component causing system per- 
formance error The drift rate is modeled as a Gauss-Markov process, 
and the operating modes are characterized by parameters associated with 
the model The forms of degraded modes are defined by the dominant na- 
ture of the sensor degradation characteristic A set of parity equations is 
generated by a linear combination of gyro outputs to eliminate the vehicle 
rate The residuals of parity equation outputs are the inputs to be processed 
by the adaptive system Each residual is transformed into a scalar state 
by application of the concept of linear aggregation of states The degraded 
gyro will be isolated 'by monitoring the parity equation output residuals 
The basic approach of the adaptive system design is then discussed. The 
adaptive system is formulated as a multistage stochastic process consisting 
of detection, identification and compensation systems 

Chapter 3 presents the main results of stochastic equations of Ito form 
for the posterior probabilities of the partially observable process when the 
unobservable process is Markovian with a finite number of states The cor- 
related measurement processes are first transformed without loss of in- 
formation into corresponding asymptotically independent processes by appli- 
cation of the innovation principle and the Kalman filtering technique The 
stochastic differential equations for the posterior probabilities are then de- 
rived by application of results in the theory of conditional Markov processes 
It is shown that the stochastic equation in the identification system can be 
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considered as a degenerate case of the detection system By application 
of a comparison theorem for diffusion processes it is shown that the real 
system can be detected by choosing a "design" value on the basis of which 
the detection process is constructed By properly defining the instant of 
degradation time it is further shown that only one detection system designed 
for a degradation mode of constant bias jump is sufficient for degradation 
modes characterized by the presence of any systematic mean value 

Chapter 4 deals with the detection system design A method is first 
reviewed in which the optimal detection problem is formulated as the solu- 
tion of a Bayesian problem. It is shown that the forementioned posterior 
probability is the sufficient statistic for optimal detection in the class of 
Bayesian problems with additive risks The optimal detection problem con- 
sists of observing the evolution of the posterior probability and is reduced 
to the determination of the optimal boundary for the decision to stop or con- 
tinue the observation at a minimum risk The problem of on-line imple- 
mentation of the Ito stochastic equation of the posterior probability is dis- 
cussed. To simplify the on-line implementation of the detection system, 
a class of linear detection systems based on Wald's sequential probability 
ratio test is studied It is shown that the detection performance of this 
system will however suffer an extra time delay in detection in comparison 
with the optimal system. 

A suboptimal detection system is developed. The feature of the system 
is utilization of feedback of uncertainty information indicated by the posterior 
probability The solution of the suboptimal detection system is formulated 
as a combined optimization problem in the class of the forementioned linear 
detection systems In the formulation of the optimal stochastic control 
problem the posterior probability function is defined to be the state vari- 
able and the uncertainty feedback defined as the control variable. A sub- 
optimal control scheme is designed to avoid the complicated computation 
for obtaining the solution to the combined optimization problem formulation 
It is found that the suboptimal control law is determined by a constant thres- 
hold. In the discrete time problem if the state variable is larger than the 
threshold, no feedback control is applied, if it is smaller than the threshold, 
the control corrects to the threshold. The difficulty in deriving the subopti- 
mal control for the continuous time case is discussed. It is shown that the 
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suboptimal system can be easily modified to detect degradation with simul- 
taneous identification of unknown polarity of mean bias, which is a useful 
piece of information for the identification stage and for efficient isolation 
of the degraded gyro. 

The performance of the detection system is studied It is shown that 
for a sequential detection system with only a single -side boundary for de- 
gradation detection the meaningful performance criteria are the mean delay 
time in detection and the false alarm error probability The miss alarm 
error probability is not well-defined since there is no explicit decision of 
normal mode The detection performance of the optimal detection system 
and the developed suboptimal detection are evaluated and compared in de- 
tail. The performance is derived by application of the theory of first 
passage times for diffusion processes It is shown that under the constraint 
of a specified mean time between two false alarms the suboptimal system 
performance expressed in mean time delay in detection is remarkably close 
to that of the optimal system. 

Simulation results are presented They consist of detection of degra- 
dation with identification of unknown polarity, a comparison of detection 
performance of the optimal, the suboptimal, and a linear system, and the 
isolation of the degraded instrument 

Chapter 5 discusses the design of the identification system — Verification 
of the normal mode and classification of degradation modes are the decisions 
made m the identification stage It is shown that the determination of the 
likelihood function in the classification process requires not only the infor- 
mation of the functional form of the conditional probability density of the 
degradation modes but also the unknown values of the associated parameters 
An invariant transformation technique is designed to eliminate the effect of 
unknown parameters An application of the technique to the measurement 
residuals will then transform the original process with unknown constant 
and ramp mean rate into the corresponding process with a zero mean 
and with a mean of attenuated ramp rate. It is shown that the ambiguous 
three -classes identification process can now be transformed into a set of 
pairwise disjoint two -class identification processes Simulation results 
of classification of degradation modes of constant and ramp bias are pre- 
sented 
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Chapter 6 considers the design of the compensation system. The 
compensation system is a linear estimator of the unknown parameter 
of the identified mode with knowledge of the correct model A techni- 
que of decoupled estimator is applied to estimate the unknown bias In 
this technique the bias estimate can be computed in terms of the resid- 
uals in the bias -free estimator During most mission periods the system 
will be operated in the normal mode, and a recursive filter is designed 
for the bias -free condition to generate the input information for detection 
and identification Once a degradation has been detected and identified, 
the filter must be modified to estimate the associated bias value With 
application of the decoupled estimator, the adaptive system can be oper- 
ated without any complicated reorganization The problem of convergence 
rate of bias estimation is discussed A slow convergence rate of the bias 
estimate is predicted analytically and is confirmed by simulation results 
The long time interval required for processing information in the estima- 
tion process in comparison with that required in the decision process 
illustrates the justification of the multistage formulation of the thesis. 

Results are summarized, discussed and related to recommended 
future studies in Chapter 7 Three appendices are presented. 
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CHAPTER 2 


Model of the Problem 


2 1 Description of the Redundant Sensor System 

Guidance and navigation reliability requirements have led to increas- 
ing emphasis on redundancy in system design. Redundancy can be provid- 
ed at the system, subsystem, and component levels, and the mechanization 
can be implemented either on an inertial platform or a strapdown system. 

In the case of a strapdown mechanization, instrument level redundancy be- 
comes appealing since the attitude reference is accomplished within a com- 
puter Degradation checks may be performed on the redundant sensor data 
before it is used in the guidance and navigation loops. Such a redundant 
strapdown system would be compact, lightweight, and theoretically highly 
reliable 

- ' (32) 

A redundant strapdown inertial reference unit (SIRU) ' was devel- 
oped at the M I. T. Draper Laboratory. It consists of a redundant instru- 
ment package, an electronics assembly, and a digital computation assembly 
The basis of the redundant concept is formulated m the redundant instrument 
package It employs six single -degree'-of-freedom gyroscopes and six lin- 
ear accelerometers, all are operated m a pulse torque to balance mode. 

The instruments are arrayed in a skewed configuration in which their input 
axes correspond to normals to the faces of a regular dodecahedron, as shown 
in Fig. 2. 1. The configuration allows the navigation system to be capable 
of operation under the condition of any three unfailed instruments, and ex- 
hibits a unique symmetry in which all instrument input axes are at a spheri- 
cal angle 2<f> (<£ ~ 31° 43’ 2. 9") from each other. This feature simplifies deg- 
radation detection and isolation, and minimizes geometric-error amplifica- 
tion by the skewed configuration in the sense that the system error ellipsoid 
exhibits a spherical symmetry when averaged over all possible failure cases 
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for every given number of instrument failures. In this thesis, the adaptive 
system is designed for the gyro package of the redundant SIRU system. 

A simplified reliability and performance analysis of the SIRU system 
is to be discussed. In general the reliability and performance of the re- 
dundant system are related through the adaptive operation of the failure 
detection, identification, and compensation systems In the following par- 
agraphs it is assumed that the adaptive system operation is perfect and the 
gyros are the only sources of failure. 

Let P be the probability of a given instrument failing during a speci- 
fied time interval when a constant failure rate, q (or equivalently a con- 
stant mean time between failures, T ), and a time interval t are assumed. 

XXI 

The probability of a given instrument failing is 


T 



The probability of any m out of n instruments failing m the specified time 
interval is given by 


P(m, n) 


n! p m 
ml (n-m)i 


(1 - P) 


n~m 


The comparison of the reliability of the dodecahedron configuration with 
the reliability of various redundant configurations is illustrated m Fig. 

2. 2. The comparison is based on a self-contained navigation system 
with available external information for isolation. A marked reliability 
advantage for the SIRU configuration is clearly illustrated. Asa further 
comment with respect to reliability it is important to note that redundancy 
itself is not a cure-all. An instrument with a mean lifetime of one-million 
hours does not preclude its possible failure. Thus a system with an on- 
line self-repair capability is of high interest. 


Performance is not generally a factor in establishing a redundancy con- 
figuration However the nature of the skewed redundancy alternate is such 
that performance must be included This need arises from "geometrical 
amplification" of gyro error that occurs with skewed gyro emplacement, 
the extent of this amplification is directly related to the particular emplace- 
ment orientation 
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PROBABILITY 



Fig, 2. 2 Mission Success Probability 




The performance of the SIKU configuration for various failure condi- 
tions are represented by comparing a SIRU reference -triad error covar- 
iance to that of the triad system in the orthogonal configuration In both 
cases, the errors of all instruments are assumed to be statistically inde- 
pendent, with zero mean and unit variance under unfailed condition. The 
triad error in the estimate of the sensed rate is obtained by a least square 
solution of the sensed rate from the subset of unfailed instruments of the 
SIRU system Table 2.1 presents the standard deviations of the errors 
and the average errors of the SIRU system: 

Table 2 . 1 

SIRU Performance vs. Failures 


No. 

Instruments 

Error 

Coordinate System Axes 

Average 


Failed 

A 

B 

C 

Error 

0 


0. 707 

0.707 

0.707 

0.707 

1 

(6 cases) 

1. 000 

0.707 

0. 707 

0. 816 

2 

(15 cases) 

1. 345 

0. 831 

0. 707 

1.000 

o(a) 

(8 cases) 

1. 345 

1.345 

0.727 

1. 581 

ib) 

(12 cases) 

3. 078 

0. 831 

0. 831 


In Table 2 1, the performance is expressed in the error coordinate 
system For a given number of failed instruments the system errors, if 
expressed in terms of the body coordinate system, will differ for each 
possible case The error coordinate system, in which errors are uncor- 
related in the error-covariance matrix through a diagonalized transforma- 
tion, can be related to the configuration of the failed and unfailed instru- 
ments to provide a physical picture of its orientation. For example, axis 
A is along the input axis of the failed gyro for the one failed gyro case 
For two gyros failed, their input axes form two acute and two obtuse angles. 
Axis A bisects the acute angles Axis B bisects the obtuse angles Axis C 
is perpendicular to both input axes A detailed analysis can be found in 
Evans and Wilcox^ 24 ^ 

Average errors in Table 2 1 are computed by adding together the covar- 
iance matrices (without diagonalization) of all possible cases for a given 
number of instruments failed and dividing by the number of matrices The 
result turns out to be equal to the identity matrix multiplied by a scalar, 
the square root of which is called the average error Thus when averaged 


24 



over all possible failure cases the errors are spherically symmetric It 
can be seen from the average errors that the performance of a set of four 
instruments in the dodecahedron configuration is roughly equivalent to that 
of three instruments in the orthogonal configuration 

2. 2 Definition of Degradation Modes 

There are a large number of error sources which result in perfor- 
mance degradation of gyro sensors. Sensors in a strapdown package • 
are subjected to a severe environment including high angular rates, 
angular vibrations and other error dynamic error sources. However, for 
inertial navigation systems which operate over long-duration mission such 
as planetary space flight, there are no significant random disturbances 
to the spacecraft except for short- interval maneuver and thrusting phases. 
Thus the gyro drift will provide the main source of error. The choice of 
a model for gyro drift depends to a great extent on the desired degree of 
accuracy. Here a mathematical model for gyro drift will be reviewed. The 
error torque M g physically present on the float is the resultant of many in- 
dependent itorque- producing errors. From the basic error physical mecha- 
nisms it can be functionally expressed of variables as specific force (g), 
float motion (r, 9, r, 0), temperature (T), and power (W): 

M e = f (g x * gy S z > x,y, z; 9^ 0 y , 9 % , x, y, z; 

0 X , e y , e z , t, w) 

All the disturbance sources can be expected to vary over only small 
ranges for a normal operating gyro, with the exception of components g^, 
gg, gQ of g along the input axis (IA), spin reference axis (SEA), and output 
axis (OA) respectively, which in some applications vary quite widely. It 

r V ' 

is therefore logical to regard g as the primary source of disturbance. 
can be expanded as a series in gj, gg, gQ about g = 0 as follows: 






, 9f 

L 9f 

. 9f 

g=0 3g I 

g=o gl 

g=o Ss 9g 0 


g=0 


So 
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The linearized model to he studied here consists of terms through first 
order- 

M e' D G +D r % +D S g S +D o8o 


The four coefficients are known, as the bias drift coefficient, and the 
acceleration sensitive drift coefficient along IA, SRA, and OA axes res- 
pectively The general coefficient (i = G, I, S, O) can be written as a 
function of disturbances. 


• D i = f x (x, y, z, Q ... T, . . . W) 


The physical mechanism of these disturbance variables may be asso- 
ciated with mechanical structure, flotation fluid, electronics and thermal 
controls 

( 49 ) 

Lorenzini m investigation of error mechanisms concluded that 
many causes of gyro drift rate variation can be classified as determi- 
nistic if a mathematically descnbable physical process which relates these 
disturbance errors to the gyro drift rate can be derived He found the rel- 
ative motion of the gyro float with respect to the case and temperature 
gradients to be major contributors to the total drift rate error observed 
He suggested that there is much room for improvement in gyro perfor- 
mance by reducing their effects through output signal compensation if those 
disturbance errors can be monitored and measured This approach for a 
more complete and accurate error model however requires a complicated 
theoretical or experimental analysis of the instrument and its associated 
equipment, and a great deal of information concerning actual degradation 
m the operational environment 

( 87 ) ( 21 ) 

Other proposed models (Wilkinson ' , Dushman v ) regard each 

drift coefficient as a stochastic process In their formulation empha- 
sis is put on the lumped behavior of the gyro output signal rather than the 
detailed investigation of the gyro output as a function of various error 
sources It is modeled as the sum of two components a stationary first- 
order Gauss -Markov process and a non -stationary random walk process 
An investigation of the physical noise processes in the gyro, such as the 
effects of Brownian movement in the damping fluid, electrical circuit noise. 
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creep in the flex leads, and bearing wear etc. suggests that the formulated 
process is a reasonable model 

From the system point of view, the output of the instrument is the 
most significant state. The instrument has degraded when and only when 
its output error exceeds the specification The approach taken in this 
thesis is that the operating modes of the gyros will be characterized by 
their output signals. The drift rate will be modeled as a stochastic pro- 
cess, yet it is recognized that a portion of the uncertainty observed in the 
output signal of a gyro stems from deterministic error sources. These 
error sources will reflect systematic values to the output signal of the 
gyro. 

The operating modes of gyros are characterized by the parameters 
associated with the stochastic process of the mathematical model The 
gyro is defined to be operating m the normal mode if the values of para- 
meters associated with the driving noise of the model are of zero mean 
and normal variance The degradation modes of gyros are characterized 
by either the presence of a systematic mean value or an increase in the 
variance of the driving noise of the model 

The types of characteristic change of the mean drift rate may be of 
two forms: a jump shift of a constant mean drift rate or a ramp of the 
mean drift rate. To define the degradation modes of the mean drift rate 
in these two simple forms permits the solution to be analytically tractable 
and the degradation to be practically compensable, and preserves the 
dominant nature of the real degradation characteristic A typical illus- 
tration of the gyro degradation mode is shown in Fig 2 3 It has been 
observed that a degradation in mean drift rate is mostly of a jump nature 

The gyro is defined to be operating in a degraded mode if the value 
of one of the parameters of the mean drift rate exceeds a "design" thresh- 
old, specified by mission performance requirements. The performance 

requirements to insure successful completion for the Apollo Mission are 

(53) 

illustrated in Fig. 2 4 ' The definition of "good" performance means 
that the gyros will contribute errors to various phases of the mission less 
than that indicated As an illustration, a change of less than 20 meru in 
bias drift rate D^, of all gyros will assure a probability of one that the 
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Fig. 2.3 Gyro Drift History 
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Fig. 2.4 Mission Gyro Performance Requirements 




error contributed by the gyros to the entry miss would be less than 10 
nautical miles. If a certain gyro indicates a change of 40 meru in D^, 
the probability to ensure "good" performance would be 0. 33. This 
probability will be reduced to zero if any gyro indicates a change of 60 
meru in D,-,. As can be seen from Fig. 2. 4, the bias drift rate is more 

U 

sensitive to mission performance than the acceleration sensitive drift 
parameters. A change in acceleration sensitive drift coefficient Dj can 
be an order of magnitude greater than that in Dq to ensure the same 
mission performance. 

Before the model of degradation modes will be defined, an analysis 
of various components of the gyro drift rate on the performance degrada- 
tion of gyros will be presented. In long term free fall space flight mis- 
sions the most important component is the bias drift rate D„{t) caused 
by bias torques independent of specific forces. Error signals due to 
acceleration sensitive coefficients D^t) (l = I, S, O) will appear only in 
short duration during thrust phases. It is a well recognized fact that 
the coefficient D^(t) is usually the most noisy component of the gyro drift 
rate and is the most sensitive indicator of performance degradation m 
a g environment. However, in the case of long duration space flight mis- 
sions their effects on mission success will be insignificant in comparison 
with the bias drift rate. This is illustrated in Fig. 2. 4 in the analysis of 
the lunar mission. 

For modeling the acceleration sensitive drift rate components, each 
coefficient D (t) should theoretically be treated as a stochastic process. 

In theory, this approach can be treated without difficulty by a simple ex- 
tension of the dimension of the drift rate process. But the approach is 
impractical due to the following considerations. First, during thrust 
phases m a short duration of minutes, the limited data just do not give 
enough information for any statistical significance. Second, unless there 
is a change m orientation and/or magnitude of the specific force during 
the thrust phase, it can be verified that the system of the extended drift 
rate process is unobservable, i. e. the acceleration- induced drift is in- 
distinguishable from a shift in bias drift. 
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Therefore it seems doubtful that one could justify the more compli- 
cated computation load resulting from extending the state vector to a 
dimension of 24 components (6 gyros, each with 4 drift coefficient com- 
ponents). In this thesis the degradation modes of acceleration sensitive 
drift rate coefficients will be modeled only as a mean bias shift. This 
is a reasonable assumption if one considers that these error sources 
will only be active for a short duration and their effects on mission per- 
formance will be serious only when there is an appreciable mean value. 

Gyros in a strap- down package will experience the full vehicle angu- 
lar rates, so a scale factor change ASF will result in a large equivalent 
drift rate change. For example, a change of 1 ppm in the nominal scale 
factor SF with a vehicle rate of 0. 1 rad/ sec. will give an equivalent drift 
rate of 1. 4 MERU. In the Apollo mission, the spacecraft is given a con- 
tinuous slow rate of 0. 3°/sec to equalize the heat exposure (Barbecue 
Mode). In this case a change ASF of 10 ppm will give an equivalent drift 
rate of 0. 6 MERU. This drift rate will degrade the performance over the 
whole mission duration. However, as long as the spacecraft rotates at a 
constant rate, this degradation mode will reflect at the gyro output signal 
as the presence of a systematic mean value of the bias drift rate DqU). 
These two error sources are inseparable, and indeed need not be separa- 
ted for compensation from the point of view of system performance. This 
degradation mode can be lumped into and treated as the corresponding 
form of the bias drift rate. Vehicle rates other than the roll rate associ- 
ated with this Barbecue Mode will appear only in short durations during 
maneuver phases. The problem of scale factor change will be treated by 
the same approach as that used for the acceleration sensitive coefficients. 
The physical mechanism of ASF change has not been studied well enough 
to define a reliable model, but some error sources have been reported. 

A SF non-linearity has been observed which depends on both input rate 
and direction. This is derived from the inherent characteristics of the 
instrument. This error source will be assumed to be corrected by on- 
line compensation based on vehicle rate information. It has been observed 
over a long-term period of months, that the scale factor for a fixed rate 
and direction show a change ASF of 100 ~ 200 ppm. Fig. 2, 5 illustrates 
a change of nominal scale factor ASF in the form of a ramp rate. 
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SCALE FACTOR DEVIATION (PPM) 




In the strap- down system, the misalignment error is practically the 
constant mechanical misalignment value It will be assumed that this 
error source has been corrected in precalibration and will not be consid- 
ered here. 

Inherent in measurements of gyro pulses are quantization errors. 

The quantization error is modeled as white noise and will be considered 
as the measurement noise Although the quantization error is uniformly 
distributed, the measurement noise is assumed to be Gaussian with zero 
mean. 

There are two different physical mechanisms for non- stationary proc- 
esses. For "normal" gyros, there is a random-walk process, and for 
"degraded" gyros a non- stationary process may result from some mac- 
roscopic change (e.g. , bearing wear) resulting m an increase of noise 
variance. The random- walk component of the gyro drift rate is proposed 
to be treated as an unknown mean of arbitrary form to be matched to one 
of the two given classes, a constant or a ramp mean bias, and will not be 
separately discussed. 

A set of real data describing the variance increase of the gyro degra- 
dation history is illustrated in Fig. 2. 6, where a computed root mean square 
standard deviation is used as a criterion of variance change. While the drift 
rate coefficient shows an increase m variance after failure over a period 
of 2000 wheel hours, only a small and gradual increase of variance has been 
observed in the bias drift rate Dq over the same period. Thus the degra- 
dation mode of variance increase will probably be of little significance on 
system performance degradation in a space flight mission m contrast with 
the characteristic of an appreciable jump in mean bias change. Moreover, 
it is to be noted that the above data refer to gyros v/ith wheels supported by 
ball bearings. There are indications and physical reasons to believe that 
there will be no appreciable wear in gas bearings. In this thesis the discus- 
sion will be restricted to the case of degradation modes characterized by 
the presence of mean drift rate but with normal variance However, some 
discussion of the degradation mode arising from variance increase and of 
the treatment of the random walk process will be presented in Chapter 7. 
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The mathematical model of the gyro drift rate process will now be de- 
fined. The gyros are operated in a ipulse-torque-to-balance mode, the sig- 
nal outputs correspond to an accumulated angle counted over some interval. 
Since the gyro error is modeled on drift rate rather than angle, the state 
variable is derived by averaging measurements over a sampling period. 

The sampling period is chosen to get a reasonable resolution of quantiza- 
tion errors. The state equations of the stochastic drift rate processes in 
continuous time are presented as follows 

(1) Gyro m normal operating mode (denoted as H ) 

The state equation is defined as 

D Q (t) = - u . D Q (t) + g (t) (2-1) 

The measurement equation is represented by 

D(t) = D Q (t) + v Q (t) 

where the notations are defined as 

g G (t) = normally distributed drift rate driving noise with known 
statistics given by E [ g G (t) 3 = 0, and 

Ejg G (t)g G (r)j = W G 6(t-r) 

v / -,(t) = measurement noise of quantization error with given 
G 

statistics E [ v G (t) ] =0, E [ v G (t)v G (T> ] = V G "S(t-r) 
u = system dynamics coefficient. 

(2) Gyro in degradation mode in the form of constant mean bias jump 
(denoted as H^). 

The state equation remains unchanged, while the measurement 
equation is represented by D(t) = D G (t) + m + v G (t) (2-2) 

The degradation state m may be caused by a jump of bias drift 
rate component, of acceleration sensitive drift rate component 
in thrusting phase, or of scale factor in presence of vehicle rate. 

(3) Gyro in degradation mode in the form of a ramp rate (denoted 
as H 2 ), 
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(2-3) 


The measurement equation can be found to be 
D(t) - D Q (t) + nt + v Q (t) 

where n is the constant ramp rate. The degradation state n for 
the ramp case may be caused by a ramp bias mean drift rate or 
a ramp change of scale factor m case of continuous maneuver 
with a constant rate. 

The corresponding state equation of the drift rate process m 
the discrete time system is expressed as 

D G (k+l) = F • D Q (k) + q Q (k) (2-4) 

The following notations are employed: 

F = 4>[(k+ 1)A, kA] = Exp {- U .A} 

A = sampling period 

The normally distributed random variable q~(k) is defined as 
(k+l)*A 

q p (k) = f $ [ (k+l)A,T ] g n (r) dT (2-5) 

G k-A U 


with known statistics given by 

E[q G (k)] = 0 

E [q G (k) • q Q U) ] = Q q • 6^ 

(k+l)A 9 

Q n = / <i> [(k+l)A, t ]W„ • dt 

G kA G 

where the statistic W~ has been defined in (2-1). 

VJT 

The measurement equation for mode H q is defined as 


D(k) = D Q (k) + r Q (k) 


( 2 - 6 ) 


where i’ G (k) is a normally distributed random variable with 
known statistics given by 


E[r Q (k)]= 


V, 


E [ r G ( k ) 


r G (^) ] = R g 


6 kg" 


k£ 


where the statistic has been defined m (2-1). 
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The measurement and state equations of the discrete time sys- 
tem of mode and Hg can be similarly defined. 

2. 3 The Parity Equations 

In order to isolate the sensor error from the desired vehicle rate 
information, a set of parity equations can be implemented by a simple 
deterministic concept using direct comparison of instrument outputs. 

Each parity equation is generated by a linear combination of gyro outputs 
such that the vehicle rate, if it is present, will be cancelled out. Two 
sets of parity equations have been studied. Since there are three indep- 
endent components of vehicle rate, four instruments are the minimum 
number to form a parity equation. By simple geometrical projection on 
the body coordinate axes, a set of fifteen parity equations can be formed 

as shown in Table 2. 2 (denoted as Set A). The other set is formed by a 

(33) 

linear combination of six gyro outputs' . By projection on each of the 
gyro input axes, a set of six parity equations can be generated as shown 
m Table 2. 3 (denoted as Set B). 

The residuals of parity equation outputs provide the only available in- 
formation reflecting gyro errors in the real system. It will be shown that 
each residual as a combination of four instrument signal outputs can be 
transformed into a scalar state by application of the concept of the linear 
aggregation of states. As an illustration, the first parity equation resid- 
ual z m Table 2, 2 will be examined under operating mode H Q . 

For the normal mode H the state equations of the gyro bias drift 
rate are represented by Eq. (2-1) 

°Gi (t) = u i D Gi (t) + g Gi (t) i = A, B, C, D 

The measurement equation is the parity equation residual. By defi- 
nition of the parity equation the residual can be simplified to represent 
only the gyro error sources: 
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Table 2, 2 

Parity Equation Set A 
(For Mechanized SIRU System) 


Instruments Equation 
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z = Parity Equation Residuals 



Table 2. 3 


NO. 


1 

2 

3 

4 

5 

6 


Parity Equation Set B 
(For Mechanized SIRU System) 

Equation 

°A ~ cos2 ^ *• ( fi B " + + 


n B - cos20 * + Q c + fi D + + Q F ) 


n c - cos2 <f> * (~^ A + a B + Q d ” + 


% - co S 2i, . <-fi a+ a B + n c + n E - n p ) 


a E - cos2$ . (n A + q b - a c + o D - n p ) 


Q - cos2c6 • (Q. + Q +0 - ft - fi_) 

F A B C D E 
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z(t) = [ c - c s s ] 


+ v(t) 


(2-7) 


where c = cos ^ 


s = sin^i 



_ D (t) 
GA 

P (t) 

GB 

D (t) 
GC 

D (t) 
GD 


0. 85065 


0. 52574 


•v(t) denotes the measurement noise after the first parity equation. 

It is observed that there is a distinction of definition of measurement 
noise of gyros before and after the parity equation. Since all gyros sense 
the same disturbance source, the measurement noises between gyros be- 
fore the parity equation are highly correlated. However, by definition of 
the parity equations, the correlated part of the measurement noises will 
be cancelled out after the parity equations. This will make detection of 
subtle degradation of gyro performance more effective due to the reduced 
background noise level. Thus the measurement noise v(t) can be defined 
as if no environment disturbance existed, and v(t) is essentially the quan- 
tization noises VQ 1 (t) 

v(t) = [ c - c s s] 


Since it can be assumed thatv^'s are independent of one another, the 
statistics of v(t) can be easily evaluated 

E[v(t)] = 0, E [ v(t) v (r)] = 2 V q 6 (t-r ) = V- 5(t-r) (2-8) 

m « 

For the construction of computationally efficient algorithms associated 

with dynamic systems of high dimension, a method is applied to aggregate 

( 2 ) 

the original system state vector mto a lower- dimensional vector (Aoki)' . 


r v (t> 

GA 

V GB (t) 

V GC (t) 

V GD (t) 
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Consider the dynamic system defined by the vector equation: 


d -2 G (t) 

= A D (t) + Bg_(t> 

dt G G 

Define a transformation of D~ (t) to a scalar x(t) 

— G 

x(t) t c £ G (t) 


The statement that x(t) satisfies the differential equation. 


- d X(t> = F x{t) + G g r 
dt 


x (°) = cd g (o) 


r -sg 

is equivalent to the condition that F and G are related to A and B by 


G = CB 
FC = CA 


( 2 - 9 ) 


This can be easily seen by a comparison with the equation 


d (C Dp), 

-i- = CA D„ + CBg~ 

dt ~ G ~ G 

Designers in general have some freedom m choosing the aggregation 
matrix C subject to some constraints imposed by the problems. For 
example, the choice of C is to be made m such a way that the error in 
modeling the original system by the system of lower dimension is min- 
imized in some sense taking into account the performance index for the 
original system. 

In the specific case considered, the choice C is seen by geometry 
to be the measurement matrix. 

C = [c - c s s] 

and 
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0 


-to 


A = 


-to 


B 


-to 


C 



-a> 


D 


( 2 - 10 ) 


where the to 's are the system dynamics coefficients for the respective 
gyro drift rate models. 

Substituting (2-10) into (2-9) produces 


F • [ c - c s s] — [ — cto^ aw-g ~ Sw q ~ 


For the state x to be a scalar, F must be a scalar. The above equa- 
tion is incompatible unless the following is true. 




- u. 


- it), 


to 


D 


- u 


and F - -w . 

But it is reasonable to assume that gyros of same type will be character- 
ized by the same system dynamics coefficient to • Thus the dynamic sys- 
tem associated with each parity equation can be reduced to a scalar system; 


x(t) = -to . x(t) + g(t) 
z(t) = x(t) + v(t) 


The statistics of the scalar random process x(t) can be evaluated easily: 

E [ g(t)]~ 0, E[g(t)g(r)] =2 W q fi(t-r) = W * 6(t-r) 

A remark of interest need be mentioned. Consider the observability 
matrix of the original system- 

Q= [C, (CA) 7 , (CA 2 ) Tj (CA 3 ) T ] 


It can be seen that Q is of rank 1 for y = to, and of rank 4 if 

to a ^to-o^id^^torA, It means that the observable system cannot be 
i-i JrS L, L) 
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aggregated. The cases for modes and can be treated m the same 
way. It is easy to show that each parity equation of the set B in Table 2. 3 
can be similarly aggregated into a scalar dynamic system. In order to 
make a comparison of merits between two sets, a sensitivity analysis will 
be made for the case of mode H^. 

For both set A and set B, the state and measurement equations are of 
the same form 


x(t) = -to • x(t) + g(t) 
z(t) = x(t) + m + v(t) 
where the state x(t) is defined as- 


(2-12) 


x(t) = cos0 • D qa - cos^6 • D gb + sin^ + sin^ 

for set A, and is defined as- 

x(t> - D ga -cos 2<t> (D ge - D gc - D gd + D ge +D gf ) 
for set B. 

With these definitions, the statistics of g(t) and v(t) can be seen to be the 
same for both sets A and B of parity equations. 

E[g(t)] = 0, E[g(t)g(r)] = W * 6(t-r) 

E[v(t)] = 0, E[v(t) v(t)] = V ’ 6(t-r) 

Suppose that gyro A has been degraded with a shift of mean bias of magni- 
tude m^. Then the parity residual z(t) will reflect an expected value: 

E[z(t)] = cos<p * m^ . for set A 

Et z(t)] = m. for set B 

Now E [ z(t) ] can be regarded as the signal used for detecting the deg- 
radation mode. Then as far as this single parity equation is concerned, 
the set B formulation will have the better sensitivity in the sense that it 
gives a larger signal to noise ratio. But this conclusion is not true if the 
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whole set of parity equations is taken into consideration. In parity equa- 
tion set B, five other equations will also give some indication of degra- 
dation, with an attenuation factor of cos 2 <fr . But m parity equation 
set A, the parity equations (no. 11-15) will give null indication of degra- 
dation. So these equations will give a perfect capability for isolating 
degraded gyro A, if all other gyros are m the normal mode. In this 
sense the parity equation’ set A will have a better isolation capability. 

In this thesis it will be assumed that there will not be the case of 
dual instruments degraded during the same interval. This will be a 
reasonable assumption if the instruments have a long mean life time 
and the instruments are operated in a system with a capability to cor- 
rect the performance degradation. To make a quantitative analysis, the 
distribution of the instrument degradation is assumed to be exponential 


P (0 g t) = 1 - e" q,t 

where 0 is the instant that degradation occurred, q is the constant fail- 
ure rate. 

It is well known that the exponential distribution has the so-called 
strong Markov property: 


P(0 g s + f| 0 > s) = 1 - 



t 


i. e. the distribution is independent of s, even if s is a random variable. 

Now suppose that an instrument has been detected m the degraded 
mode at t = s, and the system will take an additional time T to identify 
the degradation mode and compensate for the performance degradation. 
Then the liklihood of failure of two instruments is given by the probability- 


P(DF) = P(0 = s + T 


(2-13) 


A reasonable mean time between failures T of the gyroscope in- 
5 

struments is l.*7 x 10 hours. To make an estimate of the time interval 
T required for compensation, consider a simplified case. Let a random 
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process x(t) have a mean m and a correlation function defined as- 


It | 

2 T v 2 

4j (t) = a * e + m 
xx x 

The time interval T can be viewed as that required for an accurate meas- 
urement of the mean m. Consider the simple measurement scheme by 
integration. 

The measured mean M is defined as 

1 T 

M =—■ f x(t) dt 
1 0 

It can be shown that M is an unbiased estimate of mean m with a variance: 

°M =7 T {1 ""T ) (l *W T * ” m * dT 

Some simple manipulation yields 



Suppose an estimate of m with a good quality of = 0.1 is re- 

quired, the required time interval T can be found to be 

-X- = 200 

T 

x 

With a typical time constant' of gyro drift rate process r =0.5 hr . , 

X 

a time interval T of 100 hours is required. 

On substituting T into (2-13), the probability of dual failure can now 
be estimated: 

T 

T 

P(DF) = 1 - e m 

= 1 - e -(100 /170 -°00)«0.0 6 % 


45 



It can be seen that the case of dual failure can indeed be neglected. 

In this case it would not be necessary to inspect all fifteen equations 
of parity equation set A at each parity test time. A set of six out of the 
fifteen parity equations is selected to be processed for a single degraded 
instrument isolation. If a dual failure occurs, one may then resort to 
the full equation set. The parity equation set C to be actually implemented 
is shown m Table 2. 4. The choice is based on geometrical symmetry for 
a capability of unique isolation of a single instrument failure. From the 
analysis of geometrical error amplification it can be seen that the set will 
give an equal likelihood of error amplification in all three reference axes. 

A simple decision function based on the list of gyros which enter each 
parity equation will be applied to isolate the degraded gyro by monitoring 
the six parity equation output residuals. It is noted that the parity equation 
residuals m Table 2. 4 will reflect an attenuated instrument signal output 
by either a factor sin^ = 0. 52574 or cos^> = 0. 85065. A large signal-noise 
ratio will be desirable for the efficient detection of a probable instrument 
degradation. Thus for isolation of instrument A, an efficient set will be 
the equations l(ABCD), 4(ADEF), 5(BCDE) and 6{CDEF). For example, 
the parity equation 3(ABEF) will give a lower signal-noise ratio J75 

in comparison with the equation l(ABCD). But in order to giye a reliable 
isolation capability from four parity equations additional polarity informa- 
tion of the parity equation residuals must be identified. This will be dis- 
cussed m Chapter 4. 

2. 4 The Basic Approach 

The design philosophy for improvement of the navigation system relia- 
bility and performance m this thesis is to provide the system with the adapt- 
ive capability for self- repair of certain failure modes. When a part of the 
system degrades, the effect of the degradation is automatically compensated 
for by making some subtle adjustment. Implied in this activity are several 
distinct steps which must be mechanized: the existence of a degradation must 
be detected, its nature must be determined, and finally repair activity must 
be initiated. Depending on the available information in the navigation sys- 
tern the design of the adaptive system can be accomplished in different ways. 
This thesis is the design of a self-contained adaptive system without external 
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NO. 


1 

2 

3 

4 

5 

6 


Table 2. 4 

Parity Equation Set C 
(For Mechanized SIRIJ System) 

Equation 

(Q c + Oq) S = z 


Instrument 
ABCD 

ABCF 

ABEF 

ADEF 

BCDE 

CDEF 
C = cos <fr, S 


<°A - °B )C + 
-(fi + fi JC + 

(a + a )c - 

E F 

< n A - V c + 

(n c + o E >c - 
< n c - V c + 

sin <f> 


(fi + fi )S = z 
B F 


(fi + fi )S = z 
A B' 


(fi n “ fi n )S = z 

U p 


(fi + fi^js = z 
B D 


<°E - n p )S = 2 
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aiding information and without monitoring sensors of the internal states 
of the instruments. Redundant instruments are assumed which makes it 
possible to indicate instrument errors. 

The basic approach m the thesis is the application of modern system 
theory to the design of an adaptive system. The adaptive system is formu- 
lated as a multistage statistical decision and estimation process 

(1) a detection system to detect and isolate a degradation m sensors 
m the shortest possible time so as not to degrade the overall 
navigation performance by an appreciable amount. 

(2) an identification system to verify a degradation of the isolated 
sensor and to classify the degradation mode of the degraded sen- 
sor with small error probability. 

(3) a compensation system to estimate with high accuracy the unknown 
parameter associated with the identified degradation mode, and to 
recover the degraded sensor performance by compensation 

The emphasis to formulate the detection and identification systems as 
a statistical decision problem is motivated by two considerations. 

First, the formulation of the detection system as an optimal decision 
problem will show very efficient performance (expressed in time delay) 
m detecting a degradation compared with the formulation as an estimation 
problem to estimate die probable parameter change. The operating modes 
of gyros are characterized by the parameters associated with the gyro mod- 
el. The detection of the characteristic change is a problem in statistical 
decision theory. The statistical decision theory determines from which 
of several hypothesized probability distributions (with specified statistics) 
a set of samples comes, to within set error probabilities. The confidence 
level is specified by the error probabilities with respect to decision relia- 
bility with no concern about the accuracy of the true value of the parameter. 
The estimation process, on the other hand, is concerned with the accuracy 
of the estimated value. The confidence level is specified such that the es- 
timated value will approximate the true value within a small deviation. In 
comparison with the estimation process, the decision process will thus re- 
quire "coarser" information and allow greater uncertainty about the opera- 
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ting system in making a decision. The measurement time can be much 
shorter. 

Second, the degraded system may be operating in any of many pos- 
sible degradation modes. The observed input data are unclassified, bring- 
ing m additional uncertainty to complicate the estimation process. One 
will then pay for the uncertainty about the classification of the given data 
m terms of slower rate of estimation (learning). Moreover, the recur- 
sive computational procedure for estimation is more difficult than the 
counterpart with data of given classification. This problem has been dis- 
cussed in the pattern recognition field as the self-learning or "learning 
without teacher" problem (Hoet al. ). By application of statistical 

classification techniques the most probable mode will first be identified 
and then the unknown parameter will be estimated under the condition of 
an identified mode. The estimate will be more reliable and the computa- 
tion will be more efficient. 

A block diagram of the adaptive system is shown in Fig. 2.7. It is 
well-known in the identification process that there is an uncertainty rela- 
tion between measurement time and identification accuracy. To solve 
this dilemma the detection and identification systems are designed as two 
separate stages. In the detection stage the performance criterion is the 
shortest time delay m detection of a degradation under th e co nstraint of 
a tolerable false alarm error probability. After a degradation has been 
detected, the degraded instrument will then be isolated. Because of 
availability of redundant sensors, the "degraded" instrument can be 
switched off for navigation information (the instruments will always re- 
main in operation m the adaptive system). More tune is then allowed 
to make a reliable identification with small error probability. The iden- 
tification consists of making one of the following three decisions, a veri- 
fication of normal mode Hq due to a possible false detection or a possible 
re qualification of the instrument, an identification of degradation mode , 
or an identification of degradation mode The compensation system is 
an estimator of the unknown parameter of the identified mode with knowl- 
edge of the accurate model. The estimated parameter is used to recover 
the degraded sensor performance by compensation. 
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Fig. 2.7 Block Diagram of Adaptive System 






CHAPTER 3 


Stochastic Differential Equations 


3. 1 General Discussion 

The purpose of this chapter is to derive the stochastic differential 
equations for the detection and identification systems The dynamical 
equation in the detection and identification system is formulated as a 
two-dimensional partially observable ' process, in which the process 
describing the state of degradation is inaccessible to direct observation. 
The problem with partially observable information is approached by 
deriving an expression for the posterior probability of the unobservable 
component, conditioned on all a priori information and the measurement 
history up to the current time. It will be shown that the derivation of 
the posterior probability is simplified by a transformation of the corre- 
lated measurement process into a corresponding independent process 
The approach of the transformation is based on the concept of the inno- 
vation process. A simplified Kalman filtering technique is proposed to 
implement the transformation process. It will be shown that the trans- 
formation preserves the statistical information. Based on the transform- 
ed measurement process the stochastic differential equations of the poste- 
rior probability will be derived. The derivation is based on the applica- 
tion of a representation theorem due to Doob concerning the conditions 
when a process with continuous trajectories can be obtained as a solution 
of some Ito istochastic differential equation. It is shown that the deriva- 
tion of the posterior probability in the identification system can be con- 
sidered as a degenerated case in the detection system. 

' Note : The term "observable" is defined in this thesis as the meaning 
"inaccessible to direct measurements". 
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Finally, it will be shown that only one detection system, designed 
for degradation m the form of a jump of constant bias, is sufficient for 
all degradation modes characterized by a systematic mean change 
Moreover the detection system can be constructed on the basis of a 
"design" value, specified by the mission performance, of the unknown 
parameter in the real system. The approach is justified by application 
of a comparison theorem for diffusion processes. 

3. 2 Problem Statement 

Let (9 (t), z (t)) be a two-dimensional partially observable Markov 
process. The observable component is the measurement process z{t) 
of the parity equation residuals. The measurement process is repre- 
sented in each operating mode as follows- 

A. Normal Mode H Q : 
z(t) = x(t) + v(t)' 

B. Degradation Mode : 

z(t) = x<t) + m + v(t) ^ ^ 

C. Degradation Mode H 2 * 
z(t) = x(t) + nt + v(t) 

where m is an unknown and deterministic constant bias parameter, and 
n is an unknown and deterministic constant ramp rate parameter. The 
scalar state x(t) is modeled as a first-order Gauss-Markov process given 
by (2-11): 

x(t) = -u »x(t) + g(t) (3-2) 

The driving noise g(t) and measurement noise v(t) are white-noise 
processes with zero mean and known correlation- 

E[g(t)g{ T )3 = W * 6(t~r) 

\ 3“ O ) 

E[ v(t)v(r) ] = V • fi(t-T) 

During the mission the system will be degraded at an unknown instant 
6 to one of the degradation modes The observable process z(t) will thus 
reflect a jump transition from Hq to (or H Q to H^) at the instant 9 in a 
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way described by the unobservable component process 0( t). 

The process 0(t) is expressed as a stationary Markov jump process 
with two states 0 (Mode Hq) and 1 (Mode or Hg), and with a unique 
transition from state 0 to state 1 In the formulation it is assumed that 
the parameter 6, the instant at which the process 0(t) makes a transition, 
is defined by an a priori exponential distribution: 

P(0 > t) = e~ qt (3-4) 

where q is the constant failure rate of the instrument 

Using these definitions the problem can now be stated as follows Let 
w (t) ~ P {0 = t 1 z(t), t = t} be the posterior probability of degradation 
up to time t and conditioned on all a priori information and the measure- 
ment history up to time t Derive a stochastic differential equation for the 
process (t) This chapter deals with the continuous time system Some 
results of the discrete time system will be mentioned in Section 3 3 The 
corresponding formulation of the posterior probability in the discrete time 
system will be found in Section 4. 3 of Chapter 4. 

3 3 A Transformation of the Measurement Process 

To simplify the derivation of the posterior probability the correlated 
measurement process is transformed into a corresponding independent 
process in this section The approach is based on the concept of the inno- 
vation process and the Kalman filtering technique is applied for implement- 
ation of the process If the filter is implemented without modeling error 
of the real dynamic system, then it is well known that the innovation pro- 
cess is a Gaussian white noise process However, the correct model, i e. , 
the actual operating mode of the real system is only known with some de- 
gree of uncertainty in the adaptive system To simplify the design of the 
adaptive system and to reduce the computation required by the filtering 
algorithm it is proposed in this section to design the filter only on a model 
of the normal mode H Q It can be shown that the original process in the 
degraded mode can be transformed into a white noise process superimposed 
with a corresponding systematic mean with its value modified by a known 
attenuation factor Moreover the transformed and the original processes 
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contain the same statistical information The transformation is developed 
for the continuous -time system. The corresponding results of the discrete - 
time system which are useful for actual implementation will also be derived 

The innovation approach is to first convert the observation process to 
a white noise, called the innovation process, and then treat the simplified 
problem based on derived white noise observations This whitening filter 
approach was used by Bode and Shannon in the solution of the Weiner filter- 

/ Q Q\ 

ing problem for stationary processes over a semi-infinite interval Kailath' 
extended the technique to handle the non- stationary continuous time process 
over a finite time interval, and give a simple derivation of the Kalman- Bucy 
recursive filtering formulas. 

Consider the continuous observation process of the system defined in 
the normal mode Hq. 

z(t) = x(t) + v(t) (3-5) 

Assume that the process x(t) obeys the finite -expected energy require- 
ment, i e 

T * 

/ E[x(t) 2 ] dt < oc 

0 

and that x(t) and v(t) are completely independent, v(t) is the white Gaussian 
noise process 

Define 

x (t) = E[x(t) j z (t), t 2 t, Hg ] (3-6) 

- the conditional mean of x(t), given observations up to the instant 
t and assuming that { z{r) = x(t)+ v(t)} 

Then the innovation process of x(t) defined by 

A 

i/ Q (t) = z(t) - x(t) 

= x(t) - x(t> + v<t) < 3 " 7) , 

= x'(t) + v(t) 
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is a white Gaussian noise process with the same statistics as the original 
measurement noise process v(t). 

The proof essentially rests on a martingale theorem of Doob, and 
(39) 

can be found in Kailath , Thus for the normal mode Hq, the original 
correlated process z(t) (3-5) can be converted into the conditionally known 
signal process z(t) = x (t) + ^(t). For a Gauss-Markov process x(t) the 
transformation can be implemented by the well-known Kalman filter 

A heuristic explanation of the result will be given First, since 
x'(t) is the protion of x(t) that cannot be predicted from past z(* ) since 
v(t), being white noise, is completely unpredictable from past z(-), the 
quantity ^(t) may be regarded as describing the "new information" brought 
by the current observation z(t), being given all the past observations z(s), 
s < t, and the old information deduced therefrom Therefore the name 
innovation process of x(t) came into being. Physically it represents the 
measurement residual process, a familiar term in estimation theory 
Second, the innovation process ^(t) in the continuous -time case will have 
the same statistics as that of the measurement noise v(t). Since the pro- 
cesses ^p(t) and v(t) have the same finite -dimensional distribution, they 
are thus statistically indistinguishable from each other, though they have 
different sample functions. However, it will be shown that this result will 
not be true in the discrete time case 

It is to be noted that the processes z(t) and ^(t) contain the same sta- 
tistical information, 1 e , they are equivalent The proof of the equiva- 
lence is obvious if the Kalman-Bucy filtering formula is assumed Since 
^(t) = z(t) - x(t) and x(t) can be calculated from z(s), s= t, therefore ^(t) 
is completely determined by z(s), s ~ t Conversely, assuming the dy- 
namic system equation of the 'state x, the Kalman-Bucy formula 

x(t) = cox(t) + K(t) [ z(t) - x (t) ] 
x(0) - 0 

shows that x(t) is determined if {^(s), s 2 t} is known, and then z(t) can 
be obtained as z(t) = x(t) + (t), since by (3-7) 
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x(t) + Z/ Q {t) - x(t) + x'(t) + v(t) 

= x(t) + v(t) = z(t) 

Therefore /^(t) and z(t) can each be obtained from the other by means of 
a casual and casually invertible linear operation, or each can be consid- 
ered to be a deterministic function of the other Therefore they are equiv- 
alent 

Now consider the continuous observation process of the system de- 
fined in the general degraded mode H: 

z(t) = x(t) + m(t) + v(t) (3-8) 

where m(t) is defined as 

m(t) = m for system in mode 

m(t) = nt for system in mode 

It can be seen that this process can be similarly treated by a simple 
extension of the innovation transformation The approach is to regard 
the unknown bias parameters m or n as random variables, the observation 
processes can be then be whitened by subtracting out the estimates of the 
state x(t). and the bias process m(t) The transformation can be imple- 
mented by the Kalman filter with an augmented bias state m(t). The ob- 
servation process z(t) (3-8) can be similarly converted into the condition- 
ally known signal processes x(t) and m(t) without loss of information. 

z(t) = x(t) + m(t) + v 1 (t) (3-9) 

where ^(t) is the innovation process In this approach the state in the 
observation process (3-9) can be viewed as the vector form of the pro- 
cess (3-5) with an augmented bias state Thus the conclusion about the 
properties of the ^(t) process (3-7) can be formally extended to apply 
to the i/^(t) process under the corresponding assumptions Then the pro- 
cess i^(t) is a white noise process with the same statistics as the v(t) - 
process. Again the t^Ct) - /"process will have a different variance from 
that of either the v(t) - process or the i/ Q (t)- process in the discrete- 
time case. 
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Now consider the formulation of the following basic decision problem 
which is of interest in this thesis. Given observations { z(t), 0 = r = t} , 
determine, so as to minimize an expected risk function, which of the follow- 
ing modes is true: 


H 0 : z(t) = x(t) + v(t) 

H : z(t) = x(t) + m(t) + v(t) 


. (3-10) 


By application of the innovation approach, the original optimal decision 
problem (3-10) can be converted into the "conditionally known" signal 
problems • 


H q : z(t) = x(t) + v 0 (t) 

H z(t) = x(t) + m(t) + v^(t) 


(3-11) 


Since the processes x(t), x(t), and m(t) are conditionally known, the 
problem (3-11) can be further reduced to 


H q . z(t) - x(t) = i/ Q (t) 

H • z(t) - x(t) - m(t) = v^t) 


(3-12) 


It is to be emphasized that this linear transformation (3-11) preserves 
the statistical information by its casual and casually invertible property, 
and, moreover, provides the optimal information for decision by its or- 
thogonal projection property 


However, the approach requiring m(t) estimation is not an appealing 
one. First, there is no information about the actual operating mode 
(mode H^ or H 2 ) for accurately modeling the process m(t) Second, the 
estimation of m(t) will unnecessarily complicate the computation since the 
system m operated in the normal mode during most of the mission A 
different approach will be considered In this approach the observation 
process of the system defined in the degraded mode H will be treated by 
modeling the bias process m(t) as a deterministic process with unknown 
constant parameters m or n. 


Consider the following configurations of filter modeling (Fig 3. 1). 
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Fig. 3.1 Configurations of Filter Modeling. 







In configuration A the filter is constructed on the model of the normal 
system Hq, while the true dynamic system is of model H. In configura- 
tion Bj the filter is correctly constructed on the model of the real system 
H 

Then it will be shown that the observation process z(t) can be asymptoti- 
cally transformed into the following form. 

H^: z(t) = x(t) + ■ m + y Q (t) (3-13) 

for z(t) defined in degraded mode Hj, and into the approximate form 

H 2 : z(t) n x(t) + a x . n . t + y Q (t) (3-14) 

for z(t) defined in degraded mode Hg. Here a^^ is a constant parameter to 
be determined. The process x(t) is the bias-free estimate of the state x(t), 
given observations up to the instant t and conditioned on H Q that { z(t) = x(t) 
+ v(t)}. The process ^(t) is a pseudo -innovation process with the same 
statistics as the process (3-7) In the following discussion equation (3-13) 
for mode will first be derived, the result will then be extended to dis- 
cuss the approximate form (3-14) for mode Hg. For convenience of pres- 
entation the notation m(t) of the generalized bias process will be used 

The expression (3-12) can be derived as follows. Let the process z(t) 
be defined as, 

z(t) = x(t) + m(t) + v(t) 

and a Kalman filter Fq is constructed on the model of the normal system 
H q as in configuration A of Fig. 3. 1. Since the Kalman filter is a linear 
operation, then by an application of the law of superposition the transfor- 
mation of the z(t) - process can be derived as a summation of two pseudo- 

» 

processes z Q (t) = x(t) + v(t) and m(t). 

The dynamic system and the filter equation for the pseudo -process 
z Q (t) can be written as (Fig 3. 2): 

x(t) = -co x(t) + g(t) 

► 

x Q (t) = -CO x Q (t) + K(t) • [z Q (t) - x Q (t)] 
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Fig. 3.2. Filter for the Pseudo Measurement Process 



Fig. 3. 3. Filter for the Measurement Process 
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(3-15) 


K(t) = P x (t)/V 

V Q (t) = ZqU) - x 0 (t) 


where 

P (t) = the variance of the error in the pseudo -estimate x n (t) at time t, 

X u 

Consider the pseudo-bias process m(t) as the input to the filter F Q , 
the output process m(t) satisfies the filter equation 

-5i(t) = -tom(t) + £(t) [ m(t) - m(t) ] (3-16) 

The process z(t) can then be expressed as: 

z(t) = x(t) + v^(t) (3-17) 

where the processes x(t) and u 1 (t) are defined as 

x(t) = x n (t) + m(t) 

(3-18) 

J/fd) = v Q (t) + (m(t) - m(t)) -* 

It is to be noted that the representation in Fig. 3. 3 illustrates the 
implementation of configuration A m Fig. 3.1. The process x(t) is the 
bias free estimate of the state x(t) defined in (3-13) and is the only con- 
ditionally known signal of the configuration A. 

Now (3-16) of the pseudo process m(t) can be rewritten as 

m(t) = - (u + K(t)) m(t) + K(t) • m(t) 'm(O) = 0 (3-19) 

A 

where m(t) = m for degraded mode H^. 

Equation (3-19) is a first-order ordinary differential equation with 
the time-variant coefficients K(t) If the time-variant effect of K(t) on 
the solution m(t) during the transient period is to be neglected, and only 
the stationary Kalman filter is implemented, then the solution m(t) can 
be shown to be 
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(3-20) 


m(t) = 



m {1 - exp [- (K^t- to) t] } 


where (P /V) is the steady state gain of the implemented Kalman 

filter Fq The asymptotic solution of m(t) can be further simplified to 

the form 


where 


lim m(t) = -V ' m 
t-> oc 



(3-21) 


The negative sign before V is employed to be consistent with the notation 

X 

in Chapter 6 Substituting (3-21) into (3-17) and (3-18) produces 


lim z(t) = lim (x(t) + (m(t) - m(t)) + y Q (t)} 
t-*-cc t -*cc 

= lim { x(t) + (1 + V „) . m + v n (t) } 


(3-22) 


in the form of (3-13) The factor aj is defined as the coefficient (1 + Y^) 

In summary the following statement has been proved 
Given observations {z(t), 0 = t = t}, it is required to determine, so as 
to minimize an expected risk function, discrimination between two modes 
defined as 


H q : z(t) = x(t) + v(t) 

H-^ • z(t) = x(t) + m + v(t) 


(3-23) 


This optimal decision problem can be transformed into the equivalent 
discrimination problem as follows 
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(3-24) 


H q . z(t) = x(t) + i/ Q (t) 

: z(t) = x(t) + y^t) 

= x(t) + (m - m(t)) + i/ Q (t) 


Moreover, the problem can be further converted asymptotically into the 
equivalent problem of discrimination between two modes 


H, 


z(t) - x(t) = i/ Q (t) 


0 , u (3-25) 

H 1 : z(t) — x(t) = (1 + V ) m + i/ Q (t) 

The equivalence of the discrimination problems (3-23) and (3-25) can 
be stated as follows. Reasoning as the proof that the processes z(t) and 
x/g(t) contain the same statistical information, it can be shown that the pro- 
cesses v^(t) m (3-24) and z(t) are equivalent, since each can be obtained 
from the other by a casual and casually invertible linear operation More- 
over, the transformation (3-25) gives asymptotically the optimal information 
for decision by an operation of the orthogonal projection The last point 
can be seen as follows It will be shown in Chapter 6 that the following 
asymptotic relations (6-30) and (6-31) can be proved. 


8(t) = x(t) + V * m 


A 

m = m 


Substituting the above relations into the formulation of the problem in 
(3-12) 

: z(t) - x(t) - m(t) = v^(t) 


produces the asymptotically equivalent problem: 

H 1 : z(t) - x(t) = (1 + V x ) ' m + y^t) 
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Since the processes v-^(t) andr^t) are statistically indistinguishable from 
each other, then the representations (3-12) and (3-25) are asymptotically 
equivalent 

Define the transformed observation process y(t) as 

y (t) = r(t) 

where r(t) is the measurement residual of bias-free estimation, or 

r(t) = z(t) - z(t) = z(t) - x(t) 

The transformed observation process y(t) can be formally represented 
m the equivalent stochastic differential equation for the system defined m 
the mode 

dy(t) = a dt + a dw(t) (3-26) 


where 

a = (1 + V x ) • m 

2 

The parameter cr in (3-26) is the statistic of the innovation process 
i/git), and has been shown to be equal to the parameter V of the measure- 
ment noise process v(t). The w(t) process is the standard Weiner process 
with unit variance rate. 

The result will be extended to the approximate form (3-14) for the ob- 
servation process z(t) defined in the mode Hg In this case the bias process 
m(t) is defined as m(t) = nt, and the solution of m(t) in (3-19) can be shown 
to be 


m(t) = K cc n f(K +ij)t - 1 
~ 2 « 

(k + 1 or 

oc 

+ exp [-(K + u ) t] } 

OC 


(3-27) 
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The asymptotic solution ofm(t) can be shown to be 


lim m(t) 
t-*- 00 



k 

oc 

+ jj 



nt - 


K 

oc 

(K + io)‘ 
oc 


n 


(3-28) 


if the constant term is neglected for large t. Thus the z(t)- process defined 
in mode Hg can be asymptotically transformed into the approximate form 
(3-14) by substituting (3-28) into (3-17) and (3-18) 


z(t) - x(t) + (1 + V x ) nt + j^(t) (3-29) 

where V x has been defined m (3-21) 

The transformed observation process y(t) for the system defined in 
tne mode Hg for the asymptotic case can then be formally represented m 
the stochastic differential equation 

dy ~ bt dt +crdw(t) (3-30) 

in the sense of the approximation in (3-28) Here, the parameter a has 
been defined in (3-26), and the parameter b is defined as 

b = (1 + V x ) * n (3-31) 

A discussion about the factor (1 + V ) is required It is noted that the 

X 

magnitude of the factor a (or b) represents the equivalent signal amplitude 
that is used for detection However, one should not conclude that the trans- 
formation will impair the detection performance by an attenuation of the 
magnitude m (or n) From the definition V x in (3-21), one will notice that 
the factor V x depends essentially on the parameter w of the dynamic sys- 
tem., or on the correlation of the system state model. The factor (1+ V ) 
is induced by the transformation which converts the correlated process z(t) 
to a white noise process such that every transformed data is now a piece 
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of new information In case the coefficient u is zero, then the factor 
(1 + V ) will be zero The signals a {or b) will be attenuated to zero 

X 

But this means that the system state process is the Brownian motion 
process In this case the constant bias term cannot be distinguished 
from the system state 

It should be emphasized that the simplification of the proposed sys- 
tem is made possible under the condition that no information about the 
estimates of parameters m and n has been obtained The parameters 
m and n are unknown constants that have not been estimated. The detec- 
tion process however can be constructed on the basis of "design" values 
of these unknown parameters specified by the mission requirements. The 
approach is justified by the application of a comparison theorem to be dis- 
cussed m Section 3. 5. 

The transformation of the measurement process m the discrete-time 
case will now be presented. The discrete-time case will be useful for 
actual implementation. 

The following notation is employed The measurement process is 
represented in one of the operating modes defined as follows- 

A • Normal Mode Hq. 
z{k) = x(k) + r 1 (k) 

B Degradation Mode H^: (3-32) 

z(k) = x(k) + m + r^(k) 

C. Degradation Mode Hg* 

z(k) = x(k) + n • k + r^(k) 

The scalar state x(k) is modeled as a first order Gauss- Markov se- 
quence given by 


x'(k + 1) = F x(k) + q 1 (k) 


(3-33) 


The driving noise q^(k) and measurement noise r^(k) are white ran- 
dom sequences with known statistics. 


66 



E[q t (k)] = 0, E[q x (k) q : (£)] * Q . 6^ 


E[r^ (k)] = 0, E[r 1 (k) r 1 (i)] = R . 6^ 


The state x(k) in the discrete-time system is similarly generated by 
the linear aggregation of the gyro drift rate states. The gyro drift rate 
state D^k) in the discrete time case has been defined by (2-4) in Chapter 
2. The relation between the strength of the noises Q, R m the discrete- 
time and of the noises W, V m the continuous time cases can be express- 
ed as 


(k + 1)A 2 

Q = / <J»‘ [ (k + 1)A, t] w dT 

kA 

V 

R = 

A 


(3-34) 


where A = sampling period 

F=<t>[(k+1)A; kA] = Exp { -to - A} 

For the discrete-time system similar results for the innovation pro- 
cess will be obtained with some modification, the innovation process is a 
white random sequence but with a different variance from that of the obser- 
vation noise. Consider the observation process of the system in the normal 
mode H q : 

z(k) = x(k) + r , (k) 

t 1 


In this case the innovation process is defined as 


v Q (k) = z(k) - x(k 


k-1) 


= x' ( k Jk- 1) + r^ (k) 

where 


x(kjk-l ) = E[x(k) 


z(l), 0 Zis k-1] 


(3-35) 


(3-36) 
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Then it can be shown that 


E[i/ 0 (k)] = 0 . 

, * 

E[j/ 0 (k) v 0 U) ] = [P'(k) + R] 6^ (3-37) 

where P' (k) = variance of the error in the estimate x 1 (k ( k - 1) 

= E[x(k) - x(k|k-l)] 2 

The above result is well-kpown in the discrete-time Kalman Filter solu- 
tion. The variance of the {x/g(k)} sequence contains an additional term, 
which effect becomes negligible m the continuous -time case. The contin- 
uous-time case can be considered to be approached by a limiting proce- 
dure in which R becomes indefinitely large while P*(k) remains finite so 
that the variance of Vg(t) and v(t) become essentially the same. 

Now consider the observation process of the system defined m the 
degraded mode. Let the discrete bias free Kalman filter based on the 
model of the normal mode Hq be denoted as Fq as shown in Fig 3 1. 
Define the measurement residual sequence r(k) as 

r(k) = z(k) - x(k|k-l) (3-38) 

where x(k| k- 1) is the bias-free estimate defined m (3-36). Then it will 
be shown that the following asymptotic relations can be established. 

A. For the real system defined in mode 

lim r(k) = lim Vg(k) + a (3-39) 

k-*-oc k-^oc 

where the factor a in the discrete time case is defined as (in notations 
analogous to that of the continuous time case) 

a = (1 + Fi • V ) • m 

X 

B. For the real system defined in mode Hg: 

lim r(k) — limv n (k) + b • k (3-40) 

k-*ce k-*oc 
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in the sense of an approximation analogous to that defined m the continuous 
time case. The factor b is defined as 

b = (1 + F * V ) * n 
x 

The derivation will be given only for the system defined m the degraded 
mode The result can be extended to the approximate form (3-40) m an 
analogous way as in the continuous time case, and will not be discussed. 

The measurement process in the discrete time case will be expressed as 

z(k) = x(k) + m(k) + (k) (3-41) 

where m(k) is defined as m for the mode 

Define a pseudo process ZqOO as in_the continuous-time case. 

z Q (k) = x(k) + r 1 (k) (3-42) 


The pseudo innovation process r Q (k) can be implemented by the discrete 
Kalman filter F Q based on the dynamic system defined m the normal mode 



Fig. 3.4. Discrete Filter of Measurement Process. 

The following relations can be established as m the continuous time 
case: 
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r(k) = z(k) - x(k k-l) 


(3-43) 


where x(k| k- 1) is the bias-free estimate. 

By application of the law of superposition, let 

x(k k-1) = Xg(k k-1) + m(k k-1) (3-44) 

then the pseudo process m(k) satisfies the filter equation 

m(k) = F . m(k-l) + K(k) (m(k) - F • m(k-l )} m(0) = 0 (3-45) 

Define z/ Q (k) - z Q (k) - x Q (k|k-l) (3-46) 

Substituting (3-41), (3-42), (3-44), and (3-46) into (3-43) yields 

r(k) = r Q (k) + m(k) - m(k|k-l) (3-47) 

where m(k) ^ m for the mode 

let m(k) A - m • V (k) (3-48) 

Again the negative sign m (3-48) is used to be consistent w itn th e nota- 
tion m Chapter 6. With the above notation, equation (3-47-) can be rewritten 
as- 

r(k) = v Q (k) + [1+ F • V (k-1)]- m (3-49) 

On substituting (3-48) into (3-45), the factor V (k) can be snown to satisfy 

V (k) = F • (1 - K (k)) V (k-1) - K(k), V (0) = 0 (3-50) 

X XX 

The coefficient V (k) can be recursively computed. The factor K(k) is the 
discrete bias-free Kalman filter gam. It is easy to solve the asymptotic 
solution of V (k) 

X 

lim V (k) = — ^ V (3-51) 

k-«r X l-F(l-K) 
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where K is the asymptotic steady-state gain of K(k). It is to be noted that 
although the same notation is used for the factor V m both discrete and 
continuous time cases, they are of different computed value Again the 
factor is of negative polarity. By taking limits on (3-49) and using 
(3-51) the final results of (3-39) can be established 

3 4 The Stochastic Differential Equation 

In this section the stochastic differential equation of the process 7r (t ) 
for the case of degradation mode with constant bias will be derived. 

The problem of the treatment of the case of degradation mode with 
ramp rate will be discussed m Section 3 5. 

With the results of Section 3. 3, the problem can be restated as follows 

Let {a,y} be a two-dimensional partially observable Markov process. 

The observation process y(t) is represented by the stochastic differential 
equation (3-2 6) 

dy(t) = a(t)dt + o dw(t) (3-52) 


The unobservable process {a(t), t= 0} is a Markov jump process of 
two states with transition probability defined as 


P[a(t+h) = a^ 


a(t) = a Q ] 



(3-53) 


where a(t) physically represents the unknown bias parameter of the trans- 
formed measurement residual process The parameter a^ is the trans- 
formed design value specified for the degraded mode, and a Q is the corre- 
sponding value defined for the normal mode The asymptotic relation 
between the bias parameter of the measurement residual process and the 
bias parameter m of the drift rate process has been shown in (3-26) to be 


a = (V x + 1) m 

It is required to derive the posterior probability defined as 
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(3-54) 


^(t) tp {6 < t y(s)* 0 Is ^ t} 

= P {a S ai y(s), Olslt} 

The last equality comes from the definition of instant of degradation 
illustrated in Fig 3 5 

r(t) 



Fig. 3.5. Definition of Degradation of Bias Change. 

It is noted that in the real system the state space of the jump process 
a(t) is continuous, and the derivation of the posterior probabilty (3-54) 
for a continuous state space has been discussed heunstically in the ref- 

/OD\ 

erence' . However, the result cannot readily be interpreted as spec- 
ifying the dynamics of a practically realizable filter for generating the 
probability from the input data y Therefore the presentation will be de- 
rived on the assumption that the state space 'Of the a(t) -process is a finite 
set In particular, for the case discussed in the thesis, a(t) will take 
only two distinct values a^ and a^ such that the state space S = {a^, a^}. 
Denote A as the event thata(w) g a^, then the following relation can be 
established. 

Pr {A} = 2 Pr ({co*a(ot>) = a.}) 

a.eAHS 1 

= Pr {co: a(w) = a^} 

In this case, the definition (3-54) will be redefined as 


72 



(3-55) 


7T(t) = P{ a = a 1 17 ( 3 ), oS sg t> 

This definition will be used throughout the thesis 

The stochastic differential equation of the process ^(t) has been stud- 
ied by Shiryaev and Wonham In this section the main result will be pres- 
ented and the derivation will be discussed To simplify the presentation 

( 88 ) 

the approach will follow Wonham v . A derivation of the more generalized 
case can be found in Shiryaev^*^ For purpose of reference, it is conve- 
nient to introduce the following terminology 

A probability space (f2, (?, P) is defined by the specification of a non- 
empty set £2 (the sample space), a < 7 -field (Borel-field) of subsets of £2 
(events) and a probability P defined in(? A cr-field of subsets of a set f2 
is a class of subsets of £2 which contains 0 and £2 and is closed (generated) 
under the operations of complementation, countable union, and countable 
intersection. The specification of the cr-field as the domain of the proba- 
bility measure is of important significance With this specification mea- 
surability is preserved under practically all ordinary processes of analy- 
sis An extremely important property is that the limit function of any con- 
vergent sequence of measurable functions is measurable, i e. Given a se- 
quence {f^(x)} of measurable functions and given lim f^(x) = f(x). The f(x) 
will be measurable. This property will be of use in derivation of the sto- 
chastic differential equation of the process T(t). A heuristic exposition of 
the measure theory can be found in Wernikoff^ 84 ^ 

The derivation of the process v(t) is formulated in a more general form. 
Let {a(t), t = 0} be a stationary Markov jump process with a finite number 
of states (distinct step levels) a^, . . a' and let the initial 'distribution of 
a(0) be Or^(0}^j = 1, . . . , k} . Denote the transition probabilities by 

P (h) f P {a(t+h) = a a(t) = a } 

1 3 1 J 1 

1 - i/.h + 0(h) 3 - 1 ' 

P (h) =• - (3-56) 

li/^h + 0(h) j $ i . 

where the v.. = 0 are constants and 
13 
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v - 
1 


K 

2 


v 


i = 1, . . . , K 


ij 


The observed process {y(t), t>0} is defined by 


dy(t) = a(t) dt + u(t) dw(t) 


y(0) = 0 


(3-57) 


where {w(t), t = 0} is a standard Wiener process with P {w(0) = 0} = 1 It 
is required to derive a system of stochastic differential equations for the 
process 7T.(t) defined as 


= P {a(t) = a^ 


y(s), 0 g sg t } N , j = 1, 


K 


The basic points of the derivation can be outlined in three steps the 

derivation of an expression for the posterior conditional probabilities (t) , 

the generation of the (t) -pro cesses as solutions of stochastic differential 

equations by a representation theorem due to Doob, and the identification 

of the derived stochastic equations of the 7r.(t) -processes with that of the 

J 

input y(t) -process 

A formula for 7r^ (t) is first evaluated The derivation proceeds by 
considering the finite difference model (to be defined in (3-58)) of equa- 
tion (3-57) and subsequently taking limits To simplify the derivation it 
is convenient to augment to the probability space of the (a(t), w(t)} pro- 
cess, a dummy step process (a(t), t > 0} defined to have the same range, 
initial distribution and transition probabilities as the a(t) process, but 
independent of {a(t)} and {w(t)} 

Let the double indexed sequence s „ = rt/n (r 


rn 


0 , 1 , 


n, n = 1, 2, , . . ) 


and define 


' rn 


= I 


rt/n 


(r-l)t/n 


a(s) ds 


’rn 


rt/ s 

/ a(s) ds 

(r-l)t/n 


Then 
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r? = y(s ) - y(s 1 ) 

rn 17 rn J r-1, n 

rt/n 

= + / cr(s) dw(s) 

rn (r-1) t/n 


(3-58) 


For each, fixed n, the random variable (n - f ) (r = 0, 1, 

rn rn 

dependent and Gaussian with zero mean and variance 


, n) are in- 


Define 


rt/n 


v = / ff“(s) ds 

rn (r-l)t/ n 


-it ^ (t) = P {a(t) = a^ | y(s rn ), r = 0, . . . , n> 

= P {a(t) = a h , r = 1, . . . , n} 


1 


rn 


The last equality comes from the fact that the same cr-field is generated 
by the random sequence {y( s rn )} or 


'rn 


Then an expression for it At) can be evaluated using the transition 

J 

probability of the Markov process and the Gaussian distribution of the 
random sequence { b rn } 


IT (t) = lim 7T 


(n) 


3 


n-*oc 


1 


(t) 


K 


= N 2 * (0) P ,<t) ' E {e 

i=l 1 ^ 

where 


e(t) 


a(0) = a^ a(t) = a ; y(s) (3-59) 
0 g s g t} 


\-2 


0(t) = f o(s) 2 a(s) dy(s) - 1/2 / ct(s) * • a(s)“ds 
0 0 


and N is the normalizing factor for probability Let 

"t "t 

^(r>t) = exp [/ ct(s) 2 a(s) dy (s) -1/2 / cr(s) 2 a(s) 2 ds ] 


(0 S r S t) 
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By using the fact that the a(t) process is Markov and is independent 
of the y(t) process, it can be shown 
K 

7T (t + h) = N £ 7T.(t) P (h) 

3 i=l 1 X 1 


E <{ t+h) a(t) = a^ a(t+h) = a , y(s), t ^ s g t+h } 


(3-60) 


It follows that the joint process {a(t), ?r(t), t = 0} is Markov 

The expression for tt ( t) is useful in applications only when t is fixed 
Since the observation data is coming in continuously, the second step 
therefore is to derive a stochastic differential equation which generates 
the posterior distribution when the input data is continually observed 
The basis of the derivation of the stochastic differential equation is an 
application of a representation theorem due to Doob(or a related formula 
of Dynkin) concerning conditions when a process with continuous trajec- 
tories can be obtained as a solution of some Ito stochastic differential 
equation. The Dynkinformula is more easy to apply if the process jr(t) 
can be represented as an explicit functional of the observed process y(t) 
However, a representation of the derived 7r(t) expression as an explicit 

functional of the input process is unavailable, the second step in derivation 

( 18 ) 

of the stochastic process jr(t) will then be to apply the Doob theorem ' 
The Doob treatment will be extended to the multi -dimensional generaliza- 
tion It is to be shown that the process w(t) can be represented in the 
form of the Ito stochastic differential equations 


d?r (t) = m (a(t), ?r(t))dt + b (t, 7r(t)) dw (t) j = 1, . . 
11 11 


K 


(3-61) 


where ^.(t) are the standard Wiener processes 

In the application of the Doob theorem, the verification of the assump- 

(70) 

tions in the statement of the theorem is discussed in the references 

The coefficients m. and b. are calculated from the derived w.Ct) expressions 
11 3 

(3-57) by 
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(3-62) 


m.(t.,7r, a) = lim E 

3 h->0 


*(t+h) - w (t) 


a(t) = a, (t) = it 


« [ ^(t+h) - IT (t)] 

b z (t, it, a) = lim E — - 

3 h-*0 h 


a(t) = a, (t) = tt\ 


(3-63) 


After some rather tedious calculations it can be shown 

K (a-a) (a -a) 

m (t, a ,tt)=-vtt + 2 v Jr. + — — J w 

3 3 3 1= i 13 i qA 1 


y a. -a 9 

b (t , a, it) = ( 7T — L 
J J 


r 


(3-64) 


K 

where a = 2 a. n 
i=l 1 1 


In order to show that { it (t), j = 1, . . . , K} can be represented in 

the form (3-61) the sample functions of the i r (t)-process must be proved 

3 

to be continuous On using the w.(t) expression (3-59) the following relation 
■v* 3 

can'be established. 


E { K (t+h) - 7T (t)| 

J J 


oF t ) § const, h 


1+c 


> 0 


where ^ is the smallest cr-field with respect to which the random processes 
{a(s) J jr(s) J 0 < s < t} are measurable 

By application of Kolmogorov's criterion on the continuity of random 
functions it follows that the process ff\(t) is continuous with probability 1 
The representation theorem of Doob will then be generalized to allow for 
the fact that only the 7f(t) components of the {a(t), ^r(t)} process are contin- 
uous This depends on the application of properties of the martingale pro- 
cess 
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Let (3^ be the smallest c-field with respect to which the random processes 
{a(s), tt(s), 0 = s § t} are measurable Then it can be shown that each pro- 
cess 


v. 

{ t = °* = a ( s ), 3T(S)) ds I gf tg 0} 

J 3 J 0 J 

(3-65) 


is a martingale process By application of a theorem of Doob to the martin- 
gale process {n t = 0} , a standard Wiener process w.(t) can be found 
J *- 3 

such that the following results will be true 


[a(t) - a(t) (a - a(t))J 

dTT.(t) = { 7T (t) 3 V 7r.(t) 

J 3 2 3 3 

o- (t) 

K 

+ S v. »(t)} dt (3-66) 

i=l 13 1 
i^3 

(a - a(t)) 

+ it (t) - -i .. _ dw (t) 3 = 1, ■ • - , K 

3 n(t) 3 


The process w^.(t) is constructed as follows: 


w (t) = J [b (s, a(s), ^(s)] -1 dff.(s) 
3 o 3 3 


(3-67) 


Finally, it remains to identify each w.(t) process with the w(t) process 
of the observation process y(t) such that the derived stochastic equations 
of the it (t)— process (3-66) can be implemented. The observation process 
is represented by 


Let 


dy(t) = a(t) dt + cr(t) dw(t) 

I 

t 

y(t) = y(t) - / a(s) as 
0 

t 

= / ff(s) dw(s) 

0 


(3-68) 
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It can be shown that the process {y(t),^., t 0 } is a martingale Furthermore 


w(t) = / cKs)" 1 dy(s) 
0 


(3-69) 


Using the relation 

lim h 1 E{[y(t+h) - y(t)J [3r(t+h) - 7r.(t)] | 3? . > 
h-^0 . 3 3 t 


= ff(t) b [t„ a(t), ^r(t)] (3-70) 

J 


it follows that 


E { [ w(t) - w(s)] [w (t) - w (s) | 3P } = t-s, 0 < s < t (3-71) 

3 J 8 


In view of the conditions that the processes w(t) and w (t) are standard 

J 

Wiener processes, it can be verified that 

E { [ w(t) - w (t)] 2 } = 0 for t > 0 
J 

Hence for each t > 0, w(t) - w.(t) with probability 1 From continuity of 
Wiener processes it follows that with probability one, these processes 
are essentially identical. Thus the following main results have been es- 
tablished: 


K a(t) (a.-a(t)) 

d7r (t) = { [~v 7T (t) + E v . 7r (t)] - 4 7T (t)}dt 

3 3 3 !=1 x 3 1 C7(tr ] 

[a -a(t) ] -n (t) 

+ — 3 — v- 2 dy(t) j = 1 , . . , K (3-72) 

' a(t r 


The following remarks are of interest - 

(1) It follows from the above equation that th_e results can be split 
into two parts The term [ - Jr.(t) + i/..7r^(t) ] corresponds 

i^3 
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/ 


to the a priori evolution of the ir (t) when the process y(t) is not 
observed or when cr(t) = oc. On the other hand if io.(t) s 0 the pro- 
cess a(t) is degenerate in the sense that there is no transition 
among states a . The process corresponds to an unknown but 
unchanged state a . The stochastic differential equation is then 
simplified to 


(a -a(t)) 

dir (t) - it (t) — 3 — 7 - (dy - a(t) dt) (3-73) 

^ 3 a(tr 


( 2 ) The observation process y{t) can be represented by the stochastic 
equation- 


dy(t) = a(t) dt + or{t) dw(t) 


(3-74) 


where 

K 

a(t) = S a 7 r (t) 
i = l 1 1 


The proof essentially rests on the martingale theorem of Doob 
Define a process y(t) such that for t e [0, T ] 

dy(t) = dy(t) - «(l) dt 

= (a(t) - a(t)) dt + <y(t) dw(t) (3-75) 


Write / t = y( T )> 0 = t § t} 

a(t) = E {a(t) I y(T ), 0 = t ^ t} 

= E { a(t) 13^} 

where 3 ^ is defined as the smallest Borel field ( cr -field) generated 
by the random variables (y(r), 01 t S t}. Then y(t) is 3^ measur- 
able The relation 
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implies that 


t 

E (y(t) 130 = y(s)+ / E [a(r ) - a(r ) i jP ] dr (3-77) 

3 * s s 

Using a smooth property of successive conditional expectation: 

E { E {f(u>) |oP_ } ISO = E { f(u) \ fr } (3-78) 

T o fa 

if 5 s = . This means that every set in 3 is also a set in^ > or 

ST ST 

equivalently every set in is the union of sets in & . Thus 3 s is 
coarser, it contains fewer sets or "less information" On using 
(3-78), (3-77) can be written as 

t 

E{y(t)|3«} = y(s) + / E{a( T )-a( T )} I X} \ &J dt (3-79) 

s s , 

=.y(s) 


This proves that {y(t),$j., t > 0} is a martingale process Rea- 
soning as in (3-66) the representation (3-74) can be established 

(3) The stochastic differential equation 7r(t) is non-linear and more- 
over there are multiplicative factors of a diffusion nature associ- 
ated with the term dy. This will complicate on-line implementa- 
tion A simpler expression can be obtained by a transformation 
of ?r(t) mto a likelihood ratio function X(t). Consider a special case 
with ] =1,2, and define * 


m = in [ 7r(t) 1 

1 — 71- (t) 

where 7r(t)^ ir 2 (t), 1 - it (t) = 7r ^(t) 


(3-80) 


Then the equation (3-72) is simplified to : 



d7T = 7T(1-7T) 


(3-80) 


^ - 1 2 { dy - [a.^ + a Q (l-?r)]dt} + q(l-7r) dt 

CT 

7T(0) = 7T 0 

where q = v 12 from (3-53) and (3-56) and v 21 = 0 

The stochastic equation of the X(t)-process can be established by an 

(23) 

application of the Dynkm formula . The main result of Dynkin's for- 
mula for the one -dimensional case can be stated as follows. Let the pro- 
cess {z(t) 3 t > 0} satisfy the stochastic differential equation 


dz(t) = m[t 3 z(t)J dt + cr[t 3 z(tj]dw(t) 


Let d = d(t. 5) be a numerical function twice continuously differentiable 
m (t, |) for t > 0 and g m R. Put i//(t) = tj> [t, z(t)] . Then the process { ^(t), 
t > 0} satisfies the stochastic differential equation: 

d^(t) = m[t, z(t)]dt + <7[t 3 z(t)] dw(t) 


w 


here functions fh and a are given by 


frWt.P) = g) . m(t 3 f) 

8 t 9§ 




a(t 3 ?) • a (t 3 ?) 

9§ 


This equation of the (//(t)-process can also be written 


d^(t) = ) 

at 


g = z(t) 


+ i£-i%LL.a 2 (U) 

9? 


dt 


I * z(t) 


, S) 

Bg 


dz(t) 


(3-81) 


g = z(t) 
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]By identifying X(t) and 7r(t) with i//(t) and z(t) in (3-81) the stochastic 
equation 

( 3- - * 3. J 

dX = L_0_ [dy _I ( a i+ a o )dt] + q(e" X +l)dt, X(0) = X Q (3-82) 

a 

can be established. The stochastic equation is non-linear, but the mul- 
tiplication factor of a diffusion nature was eliminated. 

3. 5 A Comparison Theorem: 

As mentioned m the discussion of the system model, the actual system 
is known only to be in one of the operating modes, with a given functional 
form of the conditional probability density of the modes but with unknown 
associated parameters. The difficulty of unknown parameter estimation 
is to be avoided by choosing a "design" value on the basis of which the de- 
cision process is constructed. The performance criteria, being either 
the error probabilities or time delay m detection, are specified for this 
design value. It will be shown that the real system can be detected at least 
as fast as that achieved in the case of the design value, when the magnitude 
of the actual value of the unknown parameter is larger than that of the spec- 
ified design value. 

In this thesis the time of degradation due to a ramp rate is defined as 
the instant when the ramp rate will result in an equivalent intolerable de- 
gradation in performance corresponding to the "design" value of a constant 
bias jump in drift rate. With this definition for the degradation mode of ramp 
change, it will be shown that only one detection system designed for a deg- 
radation mode of constant bias change is sufficient for both degradation 
modes of mean change. The proof of these results is based on a compari- 
son theorem for diffusion processes The statement of the theorem can be 

(72) 

expressed as follows (Skorokhod). ' 

Suppose that a^(t, x), a 2 (t, x) and cr(t, x) satisfy the following conditions: 

(1) a^(t, x), and a 2 (t, x), and a(t, x) are continuous m their variables 
for t e[t Q , T], x e( — oc, oc) 

(2) o-(t, x)> 0, and for every c > 0, there exists a >~^Tand L > 0 such 
that for |x| g c, jy! ^ c . 

| a(t, x) - o(t, y) 1 ~ L I x - y |“ 
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Suppose further that |^(t) and f are with probability 1 continuous 

solutions of the equation 

t t 

§i(t) = §i(t 0 )+J f^sjjds + / a( s, ^(s)) dw(s) 1 = 1, 2 

t 0 *0 

Under these conditions, if a^(t, x) < a 2 (t, x) for every t and x and if 
PUi<t 0 ) = 5 2^0^ = 1 tlien ^l^ = § 2 ^^ for ever y t wi ' th probability 1. 

Loosely speaking, under the above assumptions, a diffusion process 
is a monotonic function of the transition coefficient 

The comparison theorem will now be applied to prove the following 
statement 

Let the observation process be defined 

dy(t) = a(t)dt + crdw(t) ‘ (3-82) 

Suppose that the stochastic differential equation of the process v (t) 

(3 - 81) is designed on the basis of the "design” value of the parameter 
a such that, 

(a l” a o) 

Att - 7r<l-7r) g {dy - [a 1 ?r+ a 0 (l~7r)]dt} (3-83) 


+ q(l-7r)dt 

Then if the unknown parameter has an actual value a> a^ the following 
relation is true 

7r(a, t) k ^(a^t) for all t (3-84) 

The proof is a straightforward application of the comparison theorem. 
Substituting (3-31) into (3-83) produces 

(fci-a. n) , 

d?r(a,t) = 7r( 1 — 7r) — x 2 ‘{a - [a^t- aUl-Tr)] } dt 

a 

( a l -a 0> 

+ q(l-7T)dt+ 7l( 1 - 7r) dw(t) 
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( a i-a n ) 

dJrfa.pt) = 7r( 1 - tt) g * {a 1 - [a^ + a Q (l-7r)] }dt 

<7 

( a i _a o) 

+ q(l-7r)dt + tt(I-tt) ~ ^ — dw(t) 

Since the initial conditions can be assumed to be the same, the follow- 
ing relation follows- 


?r(a, t> ^ t) 


for all t 


(3-85) 


It will be shown in Chapters 4 and 5 that the optimal decision (detec- 
tion or identification) problem consists of observing the posterior proba- 
bility process n(t), and evaluating it against a specified threshold to de- 
termine the decision. Since the process ir(t) is a diffusion process with 
a continuous trajectory almost surely, then the condition (3-85) implies 
that the process jr(a,t) will cross the threshold earlier. This proves the 
result. 

On the other hand if the value of the parameter a and the decision 
value a^ are both of negative polarity, then the condition -a< -a^ implies 
that 

^(-a,t)S 7r(-a^,t) for all t (3-86) 


This means that in this case the process 7r(-a,t) is of larger magni- 
tude but with negative polarity. Since a threshold of negative polarity will 
be specified m this case, this means that the process 7r(-a, t) will also 
cross the threshold earlier 

The following statement can be proved in a similar way. If the un- 
known parameter of the real system has an actual value a < a^, then tne 
process 7r(t) designed on the basis of the "design" value a^ over a fixed 
mission duration will have less chances to cross the specified threshold 
for decision, 1 . e. , the performance of the actual system will have a 
smaller value of false alarm error probability than the computed perfor- 
mance based on the design value. 

The problem of the treatment of the degradation mode of ramp change 
will now be discussed. The instant of degradation in the ramp case can be 
defined m two ways as shown in Fig 3 6 

a. 

(1) t = 0 when a ramp rate of an intolerable magnitude appears. 
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(2) t ~ 9 , when the ramp will result in an equivalent intolerable con- 
stant bias drift rate. 

The latter definition will be adopted in this thesis. It will be shown 
that in this case only one detection system designed for the constant bias 
case is sufficient for both cases of degradation modes H-^ and H 2 - 

The measurement equations (3-31) and (3-32) can be rewritten as 

H^- dy = adt + cr dw(t) 

H 2 : dy = bt dt + a dw(t) (3-87) 

= adt + (bt-a) dt + cr dw(t) 

Substituting (3-87) into the stochastic equation ir(t), it can be easily 

■A, 

established by the comparison theorem tnat for t ^ T + 6 ' , 

ir(t, Hg) > ff(t, H^) with probability 1 (3-88) 


the notations 7r(t, Hg) and tt (t, H^) are defined for the process 7r(t) condi- 
tioned on modes H 2 and respectively The value T is determined by 
the following relation 


(bT-a) = 0 


T 


a, 

b 


(3-89) 


where the parameters a and b have been defined m (3-31) and_(3-35) 

As illustrated m Fig. 3 6, with the employed definition of the degradation 
instant, the degradation mode H 2 can be detected at least as fast as is 
achieved in the case of tne design value by the detection system based on 
the constant bias degradation mode H^. 
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CHAPTER 4 


The Detection System 
4. 1 General Discussion 

In this chapter the optimal detection problem is formulated as the 
solution of a Bayesian problem. It is shown that the posterior probabil- 
ity is the sufficient statistic for optimal detection for the class of Bay- 
esian problems with Markovian additive risk function The optimal de- 
tection problem consists of observing the posterior probability process 
and is reduced to the determination of the optimal boundary for the deci- 
sion to stop or continue the observation at a minimum risk The problem 
of on-line implementation of the Ito stochastic equation of the posterior 
probability is discussed. 

A suboptimal detection system m the class of Wald's sequential test 
process is developed The solution of the suboptimal detection system is 
formulated as a combined stochastic problem by utilization of feedback of 
uncertainty information. The suboptimal control law in the discrete time 
system has been found to be determined by a constant threshold. The 
extension of the optimal control into the continuous time system is dis- 
cussed It is shown that the suboptimal system can be simply modified 
to detect the degradation mode with simultaneous identification of polarity 
with little additional computation 

The performance of the detection system is studied. It is shown for 
a sequential detection system with only a single-side boundary for degra- 
dation detection the meaningful performance criteria are the mean delay 
time in detection and the false alarm error probability. The detection per- 
formance of the optimal detection system and the developed suboptimal 
detection are evaluated by application of the theory of first passage times 
for diffusion processes. It is shown that under the constraint of a speci- 
fied mean time between two false alarms the suboptimal system perfor- 
mance expressed in mean time delay in detection is remarkably close to 
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that of the optimal system. Simulation results are presented. 

4. 2 Decision and Sufficient Statistic. 

The detection system is characterized by the feature that information 
about the state of interest is not directly accessible in the observation 
process In Chapter 3, the posterior probability of the unobservable state 
describing degradation conditioned on the past history of observations has 
been derived, and has been shown to be Markovian The posterior prob- 
ability gives the most complete description of the information about deg- 
radation of the system. This transformation of the original system into 
the posterior probability is the feature of the treatment of the partially 
observable process. Based on this information the problem is transform- 
ed into process with complete information. The optimal detection prob- 
lem will then be formulated as a special case of the optimal control prob- 
lem in which the optimal control is the decision to continue or stop the 
observation at a minimum risk. Under the general case of Markovian 
additive problems, it will be shown that the posterior probability is the 
sufficient statistic for determining the optimal decision. 

The concept of sufficient statistics is developed in the theory of sto- 
chastic optimal control. In the stochastic optimal control problem, con- 
trol solutions are chosen from the set of admissible controls on the basis 
of the observable measurement history up to the current instant. If there 
exists a finite set of sufficient statistics, which determine the expected 
cost to complete the process, then the optimal control becomes a function 
of these sufficient statistics. In particular the statistics which are to re- 
place the measurement history must be sufficient in the following respects. 
They must be sufficient to define the expected cost to complete the process; 
they must be sufficient to determine the selection of controls from the 
admissible controls; and they must be sufficient to determine their own 
future evolution. 

It has been shown in Chapter 3 that the posterior probability w-ill sat- 
isfy the third requirement. In this section it will be shown that the poste- 
rior probability will be sufficient for the problems of the optimal decision 
in the class of Markovian additive problems. The problem of determining 
the sufficient statistic is approached by deriving an expression for the re- 
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current formula of the expected risk function, via the method of dynamic 
programming. The posterior probability will then be shown to be suffi- 
cient to define the expected risk to complete the process. 

Consider the system in the discrete-time case, with n e [0, N] tak- 
ing 0,1,. . . N < oo, and N is fixed. Let ij^n) denote a generic. symbolic 
function. The following notations will be employed. 

t[i n = vp (n), ijj n =t{^(i), 0 < i = n} , 

4* “ = s < l g n} 

* b 

Let d = {d^}, 0 = (0 n ) and y = {y n } be the random sequence generated 
by the decision, unobservable and observable processes of the investigated 
system respectively. The corresponding realizations and sample spaces 
are denoted as {6, 0 , rj} and {D, 0,r } respectively. Let o n denote the 
triple state { d n , 0 , y n ) of the system. Define the loss function W(* ) = 0 

as follows: W(N, a N ) = total loss from the initial time i = 0 to the terminal 

time i - N, using the admissible decision function { d }, n = 1, ... N, 

N n 

which leads to the value a . 

It is assumed that at any instant n the decision space depends only on 

the past history = (d n y 1 1 *) without anticipation. Furthermore, the 

past outcomes of observations X n ^ will impose certain restrictions on the 

admissible decision space from which the decision function d can be cho- 

1 n £V 

sen, i. e. the maximal admissible subspace D n = {X n ~ l ) — •& , where 
is the admissible space with no restrictions. A decision rule is defined by 
a sequence of functionals {6 n }: 

6 n = 5 n (D n , d n " 1 ,y 11 '' 1 ) 1 n = 1, ... N 

where 6^ is the decision to be made at the time step I e(n-l, n] 

For each decision rule 6 an average risk is defined 

R N (S) = Ei W(N, o N ) (4-1) 

O 

where the mathematical expectation is taken with respect to a probability 
measure P (•) generated by the given decision rule 6. A decision rule s 

O 


89 



will be defined to be optimal, if 


p N = inf R N (5) = R N (6*) (4-2) 

6 

Consider the special case with the following assumptions. 

A. The loss function is an additive function: 

W(n, a n ) = 2 W(r, or ) 
rh n 

B. The unobservable controlled process {0} is a Markov process: 


V 0 ' 


,n 


e 11-1 ) = 0 


n 


(0; d 


n' 


Vl> 


C. The observable process {y} is a sequence of conditionally inde- 

pendent random variables. For a fixed realization of the sequence 

{si and {8}, the random variable y is independent of the 
u. n~l ^ ^ 

history y : 


n n (r, d n , e n , y n x ) = n n (r , d Qj e n ) 


D. The admissible decision space D depends on the past data in the 
Markovian manner: 


D (<9 n_1 , 
n' * 


n-1, 

y ) = 


D (0 
n n- V 


y n -i> 


Under assumptions A-D, the problem of sufficient statistics for the 
partially observable system will be discussed. Later, for the sequential 
decision problem, the assumptions will be further simplified. In particu- 
lar the unobservable process is not a function of the decision d , and the 
preceding outcomes of observations impose no restrictions on the admis- 
sible decisions. 

First the basic recurrent equation which must be satisfied by the opti- 
mal risk for decision problems is to be derived. Let 

N N n 

p 1N / y rn inf E{S W(t, ct ) | X } {4-3) 

y — 1 T 
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Then (X n ) is the minimum expected cost to complete the process 
from time a, given the observation history y 11 , using some admissible de- 
cision function { d^ } i = 1, . . . , n, and the optimal decision function { d^ } 
m the interval (n + l) i 1 5 N. it follows from the "principle of optimality" 
(Bellman that 


P N (X n )= inf E{ [W(n+1, a x ) + p N (X n+1 )j 

n 


N 


, n+l, 


n+l 


n+l 


x n } 


(4-4) 


A 5 n+l 

where W(n + 1, °’ n+ ^) - = W (n, <r ) is the incremental loss generated 
by 6 n+ ^ at time step I e (n, n + ll. 

The problem of deriving sufficient statistics is reduced to the study 
of the measurability of the risk m the recurrent formula (4-4), or to the 
study of the minimum number of statistics which are sufficient to deter- 
mine the expected risk function to complete the process. 


For the last stage n = N-l, 


N 

P 


(X 


N-l 


N-l, 



E{ W(N J a N ) 


X 


N-l} 


(4-5) 


Let an admissible decision function d^ be chosen, then by assumption 
D, dj^ e D^(X^_^). The expected value of the function (4-5) can be ex- 
plicitly expressed as: 

p N (X N " 1 ) = in f {/W(N, 0, tj, d >dF (£,r,|d X N_1 )} (4-6) 

N-l d N N y N, y N N 


By application of the Bayes rule, the probability P(*) can be expressed 
as* 



N 


(jS, n I d N , X 


N-l 


) 


= / P (rjld , X N " l 3 0 ) P fl (010, X N_1 ,d ) dP (0|X N_1 d ) (4-7) 

0 y N LN N N N-l 1 


By assumption C, since y^ is independent of the past history for fixed 
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(4-8) 


d N and °N' therefore 


Py N (nld N ,x N - 1 ,/i) = p yfT (n | d N ,g ) 


N 


The probability P g (|3 | x N , d KT ) does not depend on the future decision 


V 


N-l 


N 


P fl (/3 ! X , d N ) = P 0 ($1 X™" 1 ) 


'N-l 


(4-9) 


N-l 


since the processes { d} , {0} and {y} are defined as processes without 
anticipation. Define 

7n T1 (£) = F ft I X N_1 ) (4-10) 

N-l 0 N -i 

and notice that it is sufficient to know 0^ from 0^_^ because of the Mark- 
ovian property of the process {0} by assumption B. Then Eq. (4-7) can 
be rewritten as- 

p 9 , ()3,nld x N_1 ) 

°N y N 

= / P (rjld 0)* P e Oip,d^)dx (|) 

0 y N N 

= I P v e (i7.0 I & d ) dir ($) (4-11) 

0 y N’ N 


Denote 


P(. 


|d N^N- 


) s J P(‘ 

0 


V /3) d,r N-l (/3) 


(4-12) 


Since the true state of the process 0 is unknown, its best information 
can be represented only by a set of posterior probabilities, tt is the 
set of random varia'bTes )S')» ’J3 e 0 } , where represents the 

posterior probability of an element of the given sample space 0 . The ex- 
pression P(* 1 d^, can be interpreted as the expected value of 

P( • | d^, 0) against the posterior probability ^ based on given observa- 
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tions. P(* | 7r^._^) 1S no ^ a conditional distribution in the usual sense 

of conditioning on a given value of the random variable The symbol 

just means that the distribution according to the defining expression 
(4-12). 

If the sample space 0 is discrete with a finite set of states denoted by 
{ J3..}, where i = 1, . . . , m, then the expression can be written as; 


P( -' V Vl* “ =1 P i<* 1 V * N -1 

"n-I- {? N-r ***' ,r N-l } 


(4-13) 


i A 

where the 7r .= 7r AT . (0 .) represents the posterior probability of the state 

IN i INI 1 X ^ 

ft . based on observations up to (N-l) and P^* | d^) - P(* j d^, fi ..) represents 

the probability of an event conditioned on the event that the state j3 ^ is true. 

Therefore (4-11) can. be expressed as 



y o.n i 



X N_1 ) 


S N' y n 


<fJ - " 1 d N' Vl> 


(4- 14) 


Substituting (4-14) into (4-6) produces 

P^_ 1 ( X N-1) = inf { / W(N, 3.T?, d N ) • d P 0 ^ (fi,r) | d N ) w (4-15) 

d N N* N 

Therefore the conditional expected risk E { W(N, a^) | X^ d } has been 
shown to be defined by the posterior probability ‘ 

Thus the choice of an optimal decision s M at interval I e(N-l, N] is de- 

N-l ^ 

fined, not by the past history X , but only by the posterior probability 

, j and X AT . = (d, T 1 , y.. T ,) due to restrictions imposed on the admis- 
N-l IN — 1 JN~1 IN-1 

sible space D^( Similarly the function (X N_1 ) can be denoted 

as Xjj_i). The dependence of pj^_-j_(*) on X^_^ again stems 

from the restrictions imposed by X^_^ on the admissible decision space. 
Consider the relation (4-4) for n = N- 2: 
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(4-16) 


p N- 2 ( * N " 2) = ltlf E { W(N-l,cr N _ 1 ) + pN^x*- 1 ) | X N " 2 } 
6 N-1 

Using the relation 

N , V N-F N / v . 

P N-1 (X * P N-1 ( N-l* X N-1 ) 


equation (4-16) can be rewritten as 

p N (X N-2 ) = inf E {W(N-l,<x ) + p N (tt X )|X N ~ 2 } 

N-2 5 N-1 ^ 1 N-l ^ 1 ^ 1 

= inf { E[W(N-1,C7 N _ 1 ) ]x N " 2 , d N _ x ] (4-17) 

d N-l 

+ E 'v/’N-rW* 1 '' 2 ' <w > 

For every fixed d^_j e E n-1^ * n-2^ j first term in (4-17) can be shown 

to depend only on X N _ 2 and 7r N _ 2 as in (4-15). The second term in (4-17) 
can be treated as follows: 


E 1 P N-l (7rN_1, X N" l) IXN ’ d N-l ] 

= / P N IVrVl' ^ )dP v (nlX 1 *- 2 ,^) (4-18) 

N -i LN w y N-l L 


By assumptions B-C, the process tt has been shown to be a Markov 
diffusion process in Chapter 3. Thus the evolution, of the conditional prob- 
ability in (4-18) depends only on ,r N _ 2 and ■^n-i-’ and does not depend 

on the history X^ Furthermore the probability P(') can be evaluated as 
(4-7) 


( V 


y N-l 


,N-2 


N-l y 


(equation continued) 
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* 1 V. ^ • * N ’ 2, dN - 1> d Vl""*"’*' W 

‘ /p y.,, < * l ^N-i >dp 9„, ( ' 3| * N " 2 - d N-i> 


N-2 


(4-19) 


y N-l 


N-l 


and 
P 


• H .1 <fJ uN " 2, ^ = ; Vi 03 15 ’ "»-!> ■ d p v 2 (? lxN ‘ 2 ' w 


(4-20) 


Reasoning as before, (4-18) can be considered to be dependent on 7r N _ 2 * 

E ip” /Vl^N-l' '* N ‘ 2 - d N-ll 


^ p N -l l ’ r N- 1,d N- 1 ' ' d P y N _i (r,i *N-2- d N-l ) 


(4-21) 


Since the evolution of the probability ^ depends only on 7r g and 
X N _^, thus for each fixed value of d the expressions m the braces 
of ( 47 I 7 ) depend on only ^_2 and Consequently the optimal deci- 

sion function at the instant N-l depends only on 7r^_g and g, and more- 
over. 


P 


N 

N-2 


(X N ~ 2 ) = p N 


N-2 


(7r N-2 J *N-2* 


(4-22) 


The conclusion can be similarly shown to be true for n = N- 3, ... up to 
the final step: 

P N =inf E {W(l,<r ) + P^ (ir 1 ,X 1 ) ! w Q } (4-23) 

d l 

where tTq is the a priori probability that a degradation has occurred, p ^ 
gives the minimum possible risk in (4-2). 

The following general statement has thus been proved. If the conditions 
of assumptions A-D are satisfied, then the optimal decision function 6 at 

H -t Jjl 

the time instant n(n g 1) depends on the past history X n = (y n , d U ) 
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only through the posterior probability and X In other words 

and X n-1 are the sufficient statistics for determination of the optimal de- 
cision function. 

Consider the case of a sequential decision process such that the ob- 
servations can be terminated before the end of the time N allotted for ob- 
servation, and assume that the unobservable process is not a controlled 
process. The decision procedure at any instant n consists of two compo- 
nents: a stopping rule for the choice to continue or to terminate the pro- 
cess of observations, and a decision rule at the termination stage. The 

decision space for this case can be defined as D = U d\ where is 

1 

the decision space at the termination stage and D is the decision space 
for continuation of observations. If a decision has been made at n to con- 
tinue the observation, then the decision space D = U D'*' . The decision 
0 n 

space D n = D if a decision is made to terminate the process of observa- 
tions. In both cases the previous observations and decisions already made 
impose no restrictions on the subsequent admissible decision spaces. 

Since the dependence of the risk function P n ( ?r n ^ X ) on X^ is only through 

the restrictions imposed by X on the admissible decision space, the risk 
N n 

function p n X n ) depends thus only on the posterior probability and 
not on the statistic X , and the sufficient statistic is reduced only to the 
posterior probability. The minimum risk can be represented by 

p“(’n)“ M “tp“o<V- Vl<V>l (4 - 24) 

where Ait ) is the minimal risk obtained for the terminal decision taken 
n , u n 

at the moment n, and p (ir ) is the minimal risk for continuation of obser- 

^n,l n 

vations. Since the unobservable process in the sequential decision process 
is independent of the decision function, the posterior probability n is re- 
defined as 


* n (j3 ) = P e (01 y n ) 

n 

where y 11 is the past history of the observed process { y} . 

It is convenient to represent the total loss function in this case as 
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follows: 


W(n,0 n ) 


n-1 

S W(i,<7.) + K(n,cr ) nil 
i=0 1 n > 


(4-25) 


6 i + l 


For notational convenience, W(i,<7.) is defined as W (i, a.) where 

1 li 

6 . , n eD . It is interpreted as a loss for a certain step I e [ l, l+l). 

‘ °n+l o 

Here K(n, is defined as W (n,T n ) wlaere 5 n +l 6 • it is defined 

as the terminal loss if the observations are discontinued at the instant 


of time n. 


N 


The minimal risk p n ^( 7r n ) for continuation of observations satisfies 
the recurrent relation: 


N /- % - 
P 1 (7T ) - 
p n,l n 


inf , 
d n-KL eD 


E {W(n,o n ) 


+ 


N 

p n+l,l 


(,r n-H> 


TT } 

n 


(4-26) 


where Jr ^ is the posterior probability of after a new observation has 
taken. 

The minimal risk „(7r ) obtained for the terminal decision taken 
K n, O' n 

at the moment n is 


Pn,0 (!r n )= , inf n 0 E l K(l W 
a k e U 


7 r I 

n J 


(4-27) 


where the terminal decision d^ is taken over all possible classes to mini- 
mize the terminal cost. In the discrete case where the terminal decision 
space consists of m finite decision classes {/3 i = l, . .., m. Then 
(4-27) can be expressed as 


”a,0<V = 


inf f 


m 
{ S 
k=l 


K(j3 x , d k ) 


. n J 


(4-28) 


where K(/3., d, ) is the terminal cost from taking the decision d, when the 
l k k - K 

true class is /3., and jt represents the posterior probability of the class 

0 k- 


C \ ^ 
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The explicit dependence on time n will be discussed. In general, 
if the loss function W(n, cr n ) at each time step is a constant c; if the 9 - 
process is a homogeneous Markov process, and if the y^-process is a 
conditionally independent sequence of identical distribution (up to and 
after the degradation), then the sufficient statistics will not explicitly 
depend on time n. However, for the decision process truncated at stage 
N the risk function is time-variant such that the optimal decision bound- 
ary for the statistic 7 for termination of observations will in general 
depend on time n. For the untruncated decision process with N -*• oc the 
minimal risk (tt ) for continuation of observations can be shown to 
converge to a limit p. (tt ). The proof is based on the boundedness and 
monotonicity property of the {p^ ^ } sequence, and can be found in ref- 
erences (Blackwell ^ 9 \ DeGroot^ 6 ^). The optimal risk function p(7r n ) in 
this case satisfies 

p(7r n ) = Min {p 0 <7r n ), Pl (7r n )> (4-29) 

where (4-26) and (4-27) can be written as 

p (it > = inf Q E [K(cr ) | ] (4-30) 

0 n d k eD U n n 


and 


n'V” “ E { c + pi (: V i > 1 r J 


d n+l e D 


(4-30) 


An intuitive interpretation of the asymptotic limit is the follow- 
ing. Consider the truncation time N as the starting time, and p^ ^ ^(') 
as the initial condition for a backward propagation of the risk function. 

As N -*■ oc, the transient due to the "initial condition." dies down, and the 
asymptotic interval grows without bound. In other words, the finite 
"transient interval" tends to infinity and the "steady- state interval" oc- 
cupies all finite time. 

The conclusion regarding sufficient statistics will be different if the 
loss function is non-additive. Shiryaev^^ discussed the problem of 
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Markov sufficient statistics in non-additive Bayes decision problems, 
and derived new sufficient statistics other than the posterior probabil- 
ity. 

4. 3 The Optimal Detection System 

In this section the mam results of an optimum formulation of a 

/g4> /^3\ 

quickest detection problem by Shiryaew ; (also Stratonovictr ') will 
be presented, and the problems of the optimum system implementation 
will be discussed. 

Shiryaev considered the following problem. Suppose the observed 
process y(t), t § 0 satisfies the stochastic differential equation 

dy(t) = a(t - 0) • dt + crdw(t) (4-31) 

where w(t) is a standard Wiener process, w(0) = 0, 

E(Aw) = 0, E(Aw) 2 =At 


and 


a( T ) = 


a^ t = 0 (denoted as state H^) 
0 t < 0 (denoted as state Hq) 


The moment 0,, signaling the appearance of a degradation, is unknown. 
It is required to derive an optimum system of detecting the arrival of a 
degradation in the observed stochastic process y(t), as quickly as possi- 
ble subject to a limitation on the number of false alarms. 

A simple model of the process 0(t) is considered in both discrete and 
continuous cases. In the discrete case, the degradation occurs at a dis- 
crete instant 0 according to a geometrical distribution with parameter p. 


P(0 = 0) = 7T 

P(0 = n|0 > 0) = p(l - p) 11 " 1 


n g 1 


(4-32) 


In the continuous time version of the model, the a priori distribution 
of 0 is exponential: 
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P(0 >t) = e" qt (t S 0) (4-33) 

with the given constant rate q. The exponential distributions { geometri- 
cal distributions in discrete-time case) have the useful property that 

P(0 >t + s|0>s) = P(0 > t [ 0 > 0) 

In both cases it may be decided at any instant t to make a decision 
about the system mode. This theory assumes that the correctness of 
this decision can then somehow be determined. This is to be done by 
the identification process described in Chapter 5. Then, if it is found 
that 0 < t the detection process terminates; but observation must be re- 
sumed immediately after a false alarm. For the case of exponential 
and geometrical distributions of 0, the posterior distribution of 0 at any 
time does not depend on anything which took place before the last false 
alarm. Then, after carrying out one stage of the observation (up to the 
stopping time v at which a disorder is detected) and after ascertaining 
that it' is necessary to continue the observation, one has the same situ- 
ation (in the sense of the distribution for the unknown parameter 6) as 
before the observation began, the only difference being that one errone- 
ous signal has been produced. 

The optimum observation method will first be constructed for pro- 
cesses with discrete time as the Bayesian solution to problems involving 
the minimization of specified risks. The method of constructing Bayes 
solutions is based on the use of a recurrent equation for the risk, and it 
turns out that the Bayes solution is based on the posterior probability. 
This is in fact an application of the general result on the sufficient sta- 
tistic developed in the section 4. 2. 

Define a risk function associated with a given a prion probability tt 
and each decision function §: 

R(tt, 6) = E,W(v,0) 

o 

Let the non-negative loss function W(t, s) be such that 
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t < s 

W(t ’ s) = c 2 • (t-s) t g s (4-34) 

where and c 2 are specified positive constants, and W(0, s) < cc for s < ce. 
The constants can be viewed as penalties, with c^ as the terminal penalty 
for false alarms, and c 2 the penalty for delay in detecting a degradation: 

Let D = {D^j D 1 } be the set of all decision functions with the set 

1 

of terminal decisions about the presence of degradation, D the set of con- 
tinuation of observation. Define 

p(?r) = inf R(S, tt) 

SeD 

p (tt) = inf R(6 ,tt) (4-35) 

SeD 


Then 


Pl (7r) = inf R(s, 7r ) 

SeD 1 

p(7T) = Mm (p 0 <vr ), Pl (7r)) 


Define 

P(6 = n ,y . ' •••» y n > (4-36) 

where y^, . . . , y^ are observations taken up to the instant n. It is to be 
noted that the measurement actually observed is not the function y(t) de- 
fined m the stochastic equations (3-26) and (3-30), but rather some ap- 
proximation to the "function" y(t). Thus in the discrete time case the 
notation {y^} stands for the measurement residuals (r(n)} defined m 
(3-39) and (3-40), and represents a sequence of conditionally independent 
random variables. However the posterior probability (4-36) can be de- 
fined to be conditioned either on the sequence {r(n)} or on their summa- 
tion (equivalently on the integral y(t) in the continuous time case), since 
both generate the same Borel-field. For notational convenience the 
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"generalized" notation y will be employed to denote the observation through- 
out the rest of the thesis without further explanation. 

Shiryaev proved the following statement. The Bayes solution to the 
problem of minimizing the risk consists in observing the process ?r(n) 

( 7 r (0) = 7r ) until the first moment v at which w(v) S ir', where 1 r 1 is the root 
of the equation p Q (7r ') = '). 

The optimal risk function p(7r n ) satisfies 

P (7T n ) = Min [pg^), Pi/%)] 

where the terminal risk Pg( 7r n ) can be derived from (4-34) and (4-28) in 
section 4. 2: 

pjir ) = (l-7r ) c. (4-37) 

"O' n n 1 

The risk P^( 7r n ) satisfies the recurrent relation^^: 

= 4 Pl ( Vl (Tj)) dP y (7?I V + c 2 7T n ( 4 - 38) 

where P^(rjl 7r tl ) denotes the probability that yS ;? conditioned on the poste- 
rior probability 77 ^ (the random variable 7r n represents in turn the a prion 
probability for the observation y at instant (n+1) ). An explicit expression 
for the term (rj 1 77 ^) and the probability 7 r = 77 n+ ^(rj) can de -^ oun<4 ^ter 

in equations (4-73) and (4-72). 

Shiryaev studied the optimal observation method in a different version. 
Among all decision rules with a specified probability of false alarm 
a = PAr < 0), find an optimal rule 6* which gives the minimum mean time 
of delay for detection of a disorder: 

Jv- e I v g e ) - mf E (i/-0 | 1 / £ 0) (4-39) 

6 ~ 5€D 6 

a 

This problem can be transformed into the Bayes problem by defining 
the risk function. 

R(6, 7 r,c 1 )= c 1 P(j/< e ) + c 2 e(j/-0 I v g 0) (4-40) 
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where the loss function is now defined as 


t < s 
t > s 


W(t, s) 


c 2 (t-s) 


(4-41) 


The difference is that the constant c ^ is now the Lagrange multiplier, 
which is to be evaluated through the specified constraint a , i. e. c^ = c^(a ). 

For this choice of the risk, set 


p (ir, c- ) = inf R(6, 7r, c 1 ) 

6eD a 

P (tt, c )= inf R(S, ?r, c. ) = c (1-n) 
u 1 5eD° 

P-, (7 r, c ) = inf R(6, it, c 1 ) 

1 fieD 1 i 

The optimum decision rule in this case can be stated as follows. For any 
a - P( v < 0) the optimum method in the problem of minimizing the mean 
time of delay in detecting degradation t - r(a) consists in observing the 
process n ^ until a certain level it 1 = tt '(or), 0 S it 1 % 1 is attained for the 
first time. 

The fact that the posterior probability it = P( 0 S n | Yy.-.-. Y n ) is the 
only sufficient statistic for the optimum detection system can be viewed as 
an application of the general result of section 4. 2. The loss function in 
(4-25) corresponds to: 

l 

W(i, a.) = c 9 for all i 

1 Ci 

K(tl ’ ff n )=c l 

The loss function satisfies the additive property, and the loss over each 
step is a constant. The assumptions B-D in section 4. 2 are obviously 
satisfied. Furthermore the distribution of 6 is a homogeneous Markov 
process, i. e. P(0 = n+1 | 6 > n) * P(a . - = a^ | a^ = 0) = p for any n. Since 
only the untruncated decision rule with N = oe is considered, the sufficient 
statistic therefore can be simplified to the posterior probability it . 
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Similar results are formally extended to the continuous time case by 
Shiryaev. Some exact formulae and explicit calculations have been dis- 
cussed^ 68 ^. (Also Stratonovicbf 7 ^). In chapter 3 it was shown that 
7 r = 7r (t) is a diffusion process which satisfies the stochastic equation* 

2 

9. Q, 

dir t = (1-ir ) • (q |*?r 8 )dt + —j ^(1-^) dy t 

L ' <7 o 


and 

dy^. = a(t-0) * dt + a d w^ (4-42) 

From (4-42) the following statistics can be computed: 

E[ d7r 1 7r t ) = q(l-7r t )dt 

? a? ? (4-43) 

E[ d7T ] - — 2 " [ ’(l -7 ^)] dt + 0(dt) 

<7 

At any moment of time one can either stop or continue the observation, 

p (7r t ) = Mm [P 0 (^ t )> P 1 ( ff t )] 

where pg(7r^) is the optimum risk in terminating the observation at time 
t if ff(t) — 7T and p^(7r^) is the risk from continuing the observation at 
time t when 7r(t) = tt In the continuation region, where observations 
are made, denote P-j^ 77 ^) = p^)* The discrete recurrence relation (4-38) 
can be expanded to 


p (jr(t)) = Cg * ff(t) * At + / p (7r(t) + A7T ) • d P(A7T | 7r(t)) + 0 (At) (4-44) 


Considering the class of decision rules for which a bounded second deriva- 
tive p" (ir^) exists and using (4-43) it can be shown that p(7r satisfies 
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the differential equation: 


a 2 

—7!^ (1-TT ^ 2 p"{7T t ) + q(l-7T t ) p’(7T t ) = -C^ (4-45) 

2a 2 


in the continuation region ir ^ e [ 0 , 7 ] , where 7 is the boundary to be de- 
termined. Since the risk function is not an explicit function of time for 
untruncated processes, the equation of p( 7 r^.) is m fact reduced to an or- 
dinary differential equation. The optimal detection system thus consists 
in observing the process tt (or tt ^ m the discrete time case) and is 
reduced to the determination of the optimal boundary for the decision to 
stop or continue the observation at a minimum risk. For on-line me - 
chanization of the optimal detection system, only the stochastic differen- 
tial equation ir ^ need be implemented and the optimal boundary for ter- 
mination of observations need be determined. These problems are now 
to be discussed. 


In general, calculation of the optimal boundary 7 involves solution 
of the recursive backward equation (4-45) of the minimum risk. The 
boundary conditions for optimal decision are established by the following 
relations* 


P 0 ( 'V 

> ‘ 

dp 0 

, d Pi 

d 7 T 

7 dir 

dp 1 

=0 

d TT 

O 

II 


(4-46) 


The three boundary conditions are sufficient to find the expression 
for the risk function p^ ( 7 r ) in the region [ 0 , 7 ] and the unknown boundary 
7 . The first boundary condition is obvious, and the third boundary con- 
dition is derived from the assumption concerning the boundedness^ of the 
second derivative p" (^^.)- The derivation of the second condition will be 
discussed m section 4. 4. 2. 
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In the version (4-39) of the optimal detection system the optimal 
boundary y can be solved by the optimal boundary conditions (4-46). 

The additional unknown Lagrange multiplier c^ associated with the ter- 
minal cost Pq (7r^,) will have to be removed by the specified constraint 
on false alarms. However a simple relation for the boundary y m terms 
of the constraint of the specified false alarm error probability a can be 
solved by application of the theory of the first passage problem of diffu- 
sion processes. The result can be stated as follows. 

y=l -a (l-») (4-47) 

where n is the a priori probability of degradation defined m(4-32). It 
is noted that y does not depend on the parameters a^, a and q. The 
derivation is based on the straight-forward solution to the Kolmogorov 
backward equation associated with the diffusion process of the posterior 
probability 7 r defined in the normal mode. The detailed solution can be 
found m Shiryaev' 1 or Stratonovicm . The theory of first passage 
times of diffusion processes will be discussed in detail in section 4. 5 
for derivation of the detection performance. 

The on-line implementation of the Ito stochastic differential equation 
7 r (t) will now be discussed The Ito integral has certain special proper- 
ties which derive basically from the fact that the increments dw(t) of a 
Wiener process are of the order of (dt) 1 ^ 2 and not of (dt) as they would 
be for a smoothed random process This means that second-order terms 
(dw) 2 cannot be neglected in the Ito calculus 

Consider the differential equation (4-42), of the Ito stochastic process 

jt (t) 

2 

cl cl 

d7T = (1-* ) • (q |-3r 2 ) dt + — |-7r t (l-jr t ) dy t 

cr a 

and the observed process y(t) is defined as (4-42) 

dy^ = a dt + a dw^ 
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The process y{t) is a nondifferentiable process with probability 
1, and it represents an idealization of the actual process y H (t). In 
a practical system the observed process is not the process y(t), but 
some smoothed band-limited process y^-Ct) which passes the frequen- 
cies [0,H] only. The derivative of y^ exists with probability 1, and 
satisfies the equation 

y R (t) = a+ w H (t) (4-48) 

Consider the following terms 

t 

I (t) = / tt{s) [1- ?r(s)3 dy(s) (4-49) 

0 


and 

t 

Ijj(t) = / ^ H (s) [l-7r H (s)} dy H (s) (4-50) 

It is natural to expect that for H -*• oo, the value of Ijj(t) converges 
to I(t) However, it is well known that such convergence will not exist 
The reason is that for the smoothed process w H (t) 

m-1 „ 

““ i?0 [W H (S W ) ' W H ,S 1 )I ■ 0 

for 0 = Sq < . . . < s m „^ = t, while for the limiting process w(t), this 

limit is t Thus the equation (4-49) in the Ito form does not have a 
physical analogy and cannot be implemented on-line 

The problem of modeling ordinary scalar differential equations by 
stochastic differential equations is discussed by Wong and Zakai^^ 

Let the realizable process y^Ct) be defined by 

y H (t) = + ^ (4-si) 
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Then the process y ft) can be modeled by the Ito stochastic differential 

H 

equation 


dy(t) = [f(y(t), t) + i 9g(y(t), ^ ) g(y(t),t)]dt + g(y(t),t) dw(t) (4-52) 
^ 9y(t) 


in the sense that 

inn y w (t) ~ y(t) 
H-*°0 H 


whenever 

t 

lim [w H (t) = / , ^ H (s)ds] = w(t) 
H-*-oo 0 


Thus the stochastic equation (4-42) for on-line implementation 
should be replaced by the following equation- 

t a 2 

TTj^t) = JT(0) + / [1 - JT H (s)] [q -4-4(8)] dS 
0 , ^ 

2 X 
cl "t 

- \ — Y / [1- 2?r (s)] • tt h (s) [1-jt (s)] ds (4-53) 

, a 0 

a- t 

+ — 2 / ^ H (s) (1-tt h (s)) y H (s) ds 

cr 0 

where y-^s) is the actual band-limited observed process 

^ H (s) = a + a w H (s) 

The equation (4-53) can be handled as an ordinary differential 
equation This subtle point, which must be taken into account when 
implementing ir (t) can be avoided if an equivalent process A(t) is im- 
plemented in place of ?r(t) * 
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X(t) = inf " (t . } . 

.l-ff(t). 


From (3-82) the process A(t) satisfies the integral equation- 

2 


A(t) - X(0) 4 — / dy(s) + / (q(l+e ^ S ^) ^-)ds 


t 

r 

0 


t 

r 

0 


a . 

c 

£ 

a 


(4-54) 


In comparison with the formula for 7r(t) it is noted that in the process 
X(t) there are no multiplicative factors of a diffusion nature associated 
with the term dy(t). The stochastic integral can be computed in the same 
manner as for the smoothed process. Define 


A-^t) = in ! 


T H (t> 

l-7T H (t) 


Then the process >^.(t) defined as 


t , t -JUts) 

Xgtt) = X(0) +~f y H (s) ds + J q [ q(l+e W ) 


2 - 
1 a l 

- 2 2 ^ dS 


(4-55) 


will be the correct model for implementation of the Ito stochastic differ- 
ential equation (4-54). The above equation for X(t) is simpler, but there 
exists still a non-linear element for implementation In the next sec- 
tion a class of linear system formulations will be discussed. 

4 4 A Suboptimal Detection System 

4 4. 1 A Linear Detection System With Uncertainty Feedback 

A study of the stochastic differential equation A(t) (4-54) re- 
veals that the process X(t) will be transformed to a linear system for q - 0 

dX(t) = — |-[dy-^-a 1 * dt] (4-56) 

a 


109 .. 



Physically this means that the system is defined to be operated in an 
unknown yet unchanged mode There is no transition between operating 
modes during the mission, and the problem is reduced to that of dis- 
crimination between simple hypotheses This is the formulation of the 

( 22 ) 

well-known Wald's sequential analysis in the continuous -time case 
The Wald optimal decision rule can be simply described as follows. 

Let the observed process be defined as 

dy(t) = a dt + o dw(t) (4-57) 

Let a and J3 denote the probabilities of false decisions under the hypothe- 
ses H Q (a=0) and (a^a^ Then one observes the likelihood ratio func- 
tion X(t) (4-56) with X(0) = 0, until the first realization of one of the equali- 
ties X(t) = A or X(t) = B, where the boundaries A and B (B > 0i? A) are 
defined by the error probabilities 

A=fa (rt) 

(4-58) 

In the first case one accepts the hypothesis H Q , and in the second, 
the hypothesis A sketch of the sequential probability ratio test is 

illustrated in Fig 4 1. It is well known that the Wald SPRT has the opti- 
mal property of being the test that, on average, requires the minimum 
number of samples of all tests to come to a decision for some specified 
error probabilities Wald's sequential analysis will be discussed in de- 
tail in Chapter 5 

The distinct feature of a transition of operating modes in the degra- 
tion process distinguishes the problem from Wald's formulation. In 
general, Wald's sequential procedure is a single stage decision in the 
sense that the process terminates whenever acceptance of either class 
Hq or has been decided However in the detection system one is 
only interested in detecting the degradation as it occurs As will be 
shown in section 4 5, the definitions of stages and the error probabili- 
ties are different in these two formulations In order to use Wald's 
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B = in 


A '-In 


Fig. 4.1. Sequential Probability Ratio Test. 

procedures for the detection system, a modification is proposed 
(Shiryaev^^): When the lower boundary A < 0 is first reached, the 
observation process X(t) starts anew. This is equivalent to having 
a reset of the process from the boundary A to zero. 

It will be shown however, that the detection performance of this 
modified Wald's procedure will suffer an extra time delay in detection 
in comparison with the optimal system. Consider the case when the 
system is operated in the normal mode with a = 0 In this case equa- 
tion (4-57) can be represented by 

dy(t) = a dw(t) (4-59) 

Substituting (4-59) into (4-56) produces 

2 

ct Si 

dX(t) - a-dt + — -dw(t) (4-60) 

2a a 

It can be seen that the process X(t) for the system defined in the normal 
mode Hq will show a negative average drift rate. Similarly the process 
X(t) will show a positive average drift rate when the system is defined in 
the degraded mode H^- 
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2 

cl cl 

dX(t) = — dt - 5 - — — dw(t) 

2u ° 

Suppose a degradation occurs at t = 0, as shown in Fig 4 2, then 
the likelihood ratio function X(t) will take an extra time delay t, to com- 
pensate for the negative quantity of the likelihood ratio function accumu- 
lated before the occurrence of the degradation. Thus the linear detection 
system must be modified in order to give a better performance, and it 
will be shown how this can be done by the feedback of an uncertainty in- 
formation of the operating system. 

The motivation of the uncertainty information feedback can be pre- 
sented as follows The posterior probability function 7r(t) and the like- 
lihood ratio function X(t) have their physical meaning By definition they 
indicate the current state of uncertainty concerning the system being 
operated in mode or conditioned on the history of the available 
observations. From the a priori information it is known that the system 
is initially operated in the normal mode and characterized by a unique 
transition from the normal mode to the degraded mode at an unknown 
instant Thus one is mainly interested in detecting the degradation mode 
rather than the verification of the normal mode It has been shown that 
the detection process will take an extra time delay in detecting a degra- 
dation if both decisions are to be made. Now if the uncertainty informa- 
tion is fed back in some optimal way, such that the decision works in 
cooperation with the available a priori information, instead of working 
in complete ignorance of what the current state of information is, as it 
does in the absence of uncertainty feedback, one expects that the system 
will improve the performance of detection This is the motivation of un- 
certainty information feedback. Since there is a deterministic relation 
between A(t) and 7r(t), l e , X(t) =/ j£n [^ - j , either A(t) or ?r(t) will be 
used as the uncertainty information whenever appropriate 
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Fig.. 4. 2. Illustration of Extra Delay for Detection in SPRT. 


4. 4. 2 Some Bayesian Results of Wald's Sequential Analysis 

In this section Bayes solutions of Wald's sequential analysis 
of testing two hypotheses will be discussed Let a be and unknown con- 
stant assuming two values 0 (denoted as state H Q ) and a.^ ( denoted as 
state H 1 ) with a priori probability (1 -jt) and i r respectively Suppose that 
the random process y(t) = y^. is observed such that 

dy(t) = a dt + cr d w(t) (4-62) 

where w(t) is the standard Wiener process The problem considered 
consists of testing between two hypotheses Hq against from the results 
of observations of the process y*" = {y(s), 0 < s = t} 

Let (f?,^, P)be the probability space, and u) be the element of the set 
generated by the random process y* The probability distribution genera- 
ted by the process y* and based on the a priori probability tt is represented 

by 
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(4-63) 


Pjj. ( U ) = 7T • P 1 (U ) + (1- 7T ) • P Q ( tO ) 

Here P.(-) denotes the conditional probability generated by the process 
t 

y conditioned on state H^, 1 . e. , 

P.(u) = P(u lH) i=0,l 

It is to be noted that P.fo) represents the joint distribution. In par- 
ticular, for the discrete case P (to) denotes the conditional joint distribu- 
tion of the observations {y x , y 2 , , y t > generated by the random pro- 

cess {y(s), 0 < s g t} . 

Let 5 (u) = {t(u), d(u»)} be some decision function from the results 
of observations on the process {y(s), s = t} . Here t(w) is the random 
stopping time for terminating the observation, d(to) is the terminal deci- 
sion rule The terminal decision space consists of two points dg and 
d^. The decision d. will be interpreted as the decision that the hypothe- 
ses H. is accepted Let a(6) = P{d(to) = d^ iH Q } and |3(6) = P{d(u) - d Q IH^} 
be the false and miss alarm error probabilities at the terminal decision 
for the decision function 5 (to). Then the risk function associated with 6 (to) 
is defined as 

R ( 6 , 7r ) = 7 t{cEj(t) + a • j 3 ( 5 ) } + (l-7r) {cEg(T) + b*of(6)} (4-64 ) 

Notice that -R(6, ir) denotes the total expected cost starting from the initial 
state 7r to the terminal decision based on the decision function 5. For the 
case of specified a and 0, a and b are the Lagrange multipliers. The co- 
efficient c is the sampling cost per unit time. The decision 6* is called 
Bayesian if 

' 

R ( dT , 7T ) = inf R ( 6 , ir) for all 0 = 7r = 1 (4- 6 5) 

6 

where R(S*, it) is the Bayesian risk 

Let 7T. = P{a = a 1 I y(s), 0 < s ^ t} be the posterior probability that 
H^ is true based on observations y Define the function 

( K(7t) = Min {a tt , b (l-7r)} 
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Then it is shown in Appendix A that the risk function (4-64) can be ex- 
pressed as: 

R( 6^)| E^Jct + K(jt t )] (4-66) 

In other words,, K(?r ) is the minimum risk from an immediate 

T 

decision without further observations when the posterior probability 

of Hh is it . It will be assumed that whenever a terminal decision in 
0 1 T 

D is chosen after sampling has been terminated, that decision is an 
optimal decision against the posterior distribution of Hence m 

discussion of the sequential procedure, the optimal decision 5* in 
essence is reduced to determination of optimal stopping time r*, after 
which the terminal decision d* can be easily defined by: 

d x if a tt t * S b (1 -tt t *> 

d Q if a tt t * < b (1-7^*) (4-67) 

In this case the risk function (4-64) to be optimized can be written as 
R (tt , 6 ) = [cr + K(tt t )] (4-68) 


d*(0) = 


The risk function is thus reduced to the same form discussed in 
section 4.2, If only untruncated case is considered, the basic recurrent 
equation (4-29) for the optimal risk function in the discrete time case 
can be explicitly expressed with some notational changes as follows" 

p(7T n ) = Min [,K(7T n ), F(7T n )3 
where the optimal terminal risk K( ?r n ) is defined as 

K( 7 r n ) = Min [a 7 T n , b(l-7r n ) ] (4-69) 

and the risk for continuation F(7r n ) satisfies 

(4-70) 

F(7r n ) = c + E , fF(7r n+1 ) I 7r n ] 


115 



The expectation operator is defined as, 


cc 


E [F(ir n+1 ) 1 7T n ] = f F(7T„ +1 (r?)) P„(U kjdrj (4-71) 


■oe 


n+1 


n 


where the posterior probability operator based on the observation y 
is expressed as : 


v n+l (y) 


% ' p i (y > 


(1-7r n )p 0 (y) + w n ' p l^ y) 


(4-72) 


and the probability density p(y I rr ^) against the posterior probability 

5r can be written as: 
n 

P(y I* n > = (l-%> P 0 <y) + P x ( y ) (4-73) 


where Pg(y) and p^(y) are the probability density function of the obser- 
vation y conditioned on Hq and respectively. 

The exact solution of the recurrent equations (4-70) in the discrete- 

/ oe \ 

time case is analytically difficult to solve. Whittle v 1 discussed the 
general solution form for the problem of discriminating among m hy- 
potheses with the function KXtt^) assumed to be a linear function of 7r^. 
However the explicit solution is not tractable. For the special case of 
discrimination between two simple hypotheses and with the approxima- 
tion of neglecting overshoot of F(7r n ) at the final decision boundary. 
Whittle derived the general solution form as follows: 


F (7T ) = ljj 1 * 7T + ijj • (1-7T ) 

n Y 1 n T 2 ' n 


c 7T 1—7 r 

+ S_ i n ( El) + 


C(1-7T ) IT 


a- 


10 


7 r 

n 


a 


01 


n 


(4-74) 


where ij' ^ and ip 2 are coefficients to be determined by the boundary 
conditions. The coefficients a . are known as measures of discrimini- 
natory information and defined as: 
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(4-75) 


* p,-(y) 

* S In (— • ) P-(y) dy * 

13 ' -oo p (y) 1 

J 

The solution (4-74) is identical with that which would be obtained by 
determining approximate values of the probabilities of error and ex- 
pected sample size from Wald's identity m the usual manner (Wald^^). 

The solution of the recurrent equation in the continuous-time case 
will be derived in detail. The original discrete-time problem is ap- 
proximated by a continuous-time diffusion process* in which one is 
essentially making use of the first and second derivatives of the risk 
function. Only the case of "two hypotheses will be considered* the 
general solution form for the untruncated procedure in the continuous 
system can be simply derived by the solution of an ordinary differen- 
tial equation. 

It was shown m Chapter 3 that for the class of linear detection 
systems jr(t) satisfies the stochastic equation: 

2 

a l 2 a l 

d?r t = 2 ~ < 1_7I V‘ dt + — T \ (1 “V dy t (4-76) 

a a 


where a Q is assumed to be zero. It was shown in (3-74) that a standard 
Wiener process exists such that: 



= a dt + a dw^ = a^ 


dt + cr dw^ 


(4-77) 


Then the process 7r(t) satisfies the stochastic differential equation: 




*t (1 ' 7r t > dw t 


(4 r 78) 


This easily yields 
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E [ d7T | ir ] = o 


(4-79) 


E [(d?r) 2 | it t j = 


(7 


2 "t 


Cl 11 

IT. (1 




dt 


The risk F(7rp for the untruncated case in the region D of con- 
tinuation of observations satisfies the relation 


F(t r ) = c • At + E { F(tt + Ajt)} + 0(At) 

t 7T t t 


(4-80) 


9 F 9 2 F 

Assuming that the derivatives -j^-and 0 exist and expanding 

c)7T Ott 2 

F(tt + At r) into a Taylor series, it can be shown that 


{F<* t + A* ) } = F (T t ) + -§£ (l t ) (A* ) 


2 

+ (&K) 2 + 

2 9t rf 1 ff t 


(4-81) 


Using equation (4-79) produces 


2^ 

E {F(7T t + Att) } = F(7T t ) (l-5T t ) 


9 2 F(TT t ) 

3tt 2 


At + 0(At) 


(4-82) 


Substituting this expression into equation (4-80) and passing to the limit 
as At 0 gives: 


1. 

2 



2 ... s 2 
T t 



c = 0 


(4-83) 
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The recurrent equation for the untruncated case is thus simplified 
to an ordinary differential equation. The general solution of (4-83) 
can be obtained as 




F <V = 717 rx + +3 • \ + *4 < 4 - 84 > 

(a^/20- ) t 


A simple manipulation of (4-84) gives 


c 7T c 1-7T, 

F( 7 T ) = — 5 y ( 1 - 7 T ) in (-, ) + — 5 jr £n( — ) 

t <a*/20 t l-;r t (a"/20 t ^ 


+ ’ (l“7T t ) 


(4-85) 


The density functions p^y) and P Q (y) m (4-75) are the normal 
distributions N(a^, o) and N(0, o) respectively, since the observed 
processes are modeledas Gaussian. Then a simple computation of 
(4-7 5) shows the result: 


“01 " a io = (a i l2(j2) 


(4-86) 


Thus a completely analogous form of the general solution F (tt-^) can 
be obtained in the discrete and continuous time cases. 


It is instructive to study the solution form F(tt^). The solution con- 
sists of two parts. Mathematically the two terms associated with the coef- 
ficient c are the particular solution, and the other two terms are the 
homogeneous solution. Denote the notation J( 7 r.j.) as: 


J(0 = „ 
t a 


10 


in(~ 


1-7 b 


7T . 


7T , 


+ 


a 


01 


' 1 -’ r t ) fc( -n? ; 


-) (4-87) 


The term - J(tt^) then represents the average of the current sam- 
pling cost of discrimination of the hypotheses. It is important to em- 
phasize the - JCtt t ) represents the sampling cost used up to the current 
state 7 r^, and does not represent the sampling cost to complete the pro- 
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cess. If the state ir ^ is the termination state, then represents 

the total sampling cost for the whole process. The facts that J<l/2) = 

0 implies that in Wald's formulation the sampling cost is referred to 
the datum 7 = 1/2. The state ir - 1/2 reflects a state of no informa- 
tion ab'out either hypothesis, and is the initial state to gam the informa- 
tion at the expense of the cost of sampling. 

The coefficients and ^ m (4-85) represent the non- negative 
weighting factor to be determined by the boundary conditions at the 
decision points. The results for determining the optimal decision 
boundary have been discussed by Chernoff, ' ' Whittle^ ' and others. 

The decision boundaries are to be chosen in such a way as to minimize 
the function F(tt^) (Fig. 4. 3). The main result states that if F possess- 
es continuous first derivatives and the boundary 7 is optimal, then 


9 F „ 8 K 
dir i 87 r. 


(i=0,l ) 


F(ff-) = K(*r.) 


(4-88) 


on the boundaries 7, at which 8K| 8 tt. exists, 
result can be found in references^ ^ ^6). 


An heuristic proof of the 


As mentioned before the optimal decision rule is characterized by 

determination of the optimal boundary, the derivation of the boundary 

thus gives the complete information about the structure of the optimal 

procedure. Let D 1 denote the region of 7r for continuing observations, 
0 

and D the region for the terminal decision, then the optimal risk 
function satisfies: 


2 2 2 d*F<7r ) 

(i-v 1 

2 2 x x ,2 

a dir. 


+ c = 0 ?r,eD 


K(tt^) = Min [§l 7 r^, b(l-7r )] 


TTfCD 


(4-89) 


The solution to the equation (4-89) has been derived in (4-85). 
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F(»r). K(tt) 



Fig. 4.3 Risk Functions of Sequential Decision Problem 

The two straight lines together represent 
K(tt), the expected risk on terminal decision. 
The two curves represent conceivable forms 
of F(tt), the expected risk for continuation of 
observations. 
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(4-90) 


F(* t ) = J(?r t ) + G(jr t ) 


where G(^ t ) = + 0 2 • ( 1 - 7 ^), and J( 7 r t ) has been defined in (4-87). 

The minimum risk at the instant t is expressed as: 

p(jr ) = Min {k ( 7 r ) , F (?r ) } 
n x n n 

n 


The optimal boundary conditions give: 

F(7.) = K(y.) 


(4-91) 


dF 
d ir 


7 i 


dK 

dir 


7,* 


1 = 0,1 


where 7 . are the two optimal boundaries for accepting hypotheses H 
(i = 0, 1) respectively. In Wald's ordinary procedure, there are two 
unknown boundaries for Hq and H^, and two unknown constants 0 ^ and 
0 2> The four unknowns are determined by the four conditions at the 
optimal boundaries (4-91). 

4. 4. 3 Formulation of a Stochastic Control Problem 

In this section it will be shown that imbedded m the detection 
system there can be defined a combined optimal control and decision 
problem. In order to motivate the formulation the system of Wald's 
sequential analysis will first be discussed. 

Consider the optimal risk function at stage n in the formula- 
tion of the sequential system: 

p(ir ) = Min {K(tt ), F(?r )} (4-92) 

n c n 11 

where 6 n is the decision for continuation or termination of observations 
'to be made at stage n. The solution form of the risk F( 7 r n ) for continua- 
tion of observations can be explicitly expressed as {(4-85) and (4-90))- 

F(?r n ) = G ^ n > + J(*n> (4-93 ) 
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where 


G( % ) = 


#1 


n 


+ 


(1 - IT ) 

n 


(4-94) 


7T 


J {?T ) = - — 

n a 


7T i’ll (— JL) . 


10 


1- 7T 


a 


n 


01 


1- 7T 

(1-7T„) in ( ) 

7 r 

n 


n 


IT 


n 


J 1 (7T n ) + (1 -*T n ) • J Q <7T n ) 


(4-95) 


It has been mentioned that the value “JC 7 ^) represents a measure of 
the expected sampling cost used up to the current state it . The sam- 
pling cost of observations gives the information necessary to reach 
a greater degree of certainty such that the probability of making an 
incorrect terminal decision is decreased. Some insight can be ob- 
tained from J(tt ) in terms of information measure. The measure 
n 

of discriminatory information a-, is defined in (4-75): 

«J 


<, . r In 

^ p-(y> 


P-l (y) d y 


. p^y) 

The coefficient represents the expected value of xn (■ - 


■) if a sin- 


3 c 

gle observation y is taken and H. is true. The coefficient^ ) can 

1 “lO 

be interpreted as a measure of the cost per unit of discrimination that 
is true. The term - J^( 7r n ) represents the sampling cost to gener- 

fi 7 r 

ate the information fn ( n ) for discrimination in favor of H. against 

1-7T 


0 

Hq. A similar interpretation can be given to (— ) and "Jg^n ^ 4 J( ff n ) 

represents the expected value of the information measure over the cur- 
rent posterior probability 7r n . For the case of sequential analysis 
where decisions on both Hq and are to be made, the neces- 

sary expected sampling cost of observations to give the information for 
discrimination of either hypothesis H. or H n . It was shown by Wald and 
Wolfowitz 1 ' that in the class of all hypothesis tests of Hq against H^ 
subjected to the same false and miss alarm error probabilities, Wald's 
sequential analysis requires, on the average, the minimum number of 
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samples of all tests to come to a decision. 

However, in the detection system one is only interested in mini- 
mizing the mean time delay m detecting a degradation. The informa- 
tion given by sampling which reduces the uncertainty for the decision 
of the mode ^ is of important value, while the information given by 
sampling for discrimination in favor of the mode Hq is of no impor- 
tance, since no decision of the normal mode is to be made in the de- 
tection system. This analysis suggests that a further improvement 
in reducing the sampling cost can be obtained in the detection system. 
The improvement is to be obtained by utilization of uncertainty infor- 
mation feedback to formulate the detection system as a combined opti- 
mal control and decision problem. 

The formulation is started by identifying the state variable, con- 
trol (decision) variable, and the risk function. It is natural to define 
the posterior probability it ^ as the state variable, since it represents 
the sufficient statistic of the detection process. The control variable 
is assumed to be an explicit function of the state variable 7 r^, 1 . e. , 

u = u<7T ). It is assumed that controls are to be executed only at the 
n n 

sampling instants n in the form of impulses with sizes denoted by u n - 
With this mechanism the control can be simply implemented in the 
form of a jump in it at each sampling instant. Moreover, the nonlin- 
ear stochastic equation of the state t e(n, n+1) during each sam- 

pling interval will not be affected. In this case the state variable it (t) 
is in fact defined to be the posterior probability based on the observa- 
tion history up to one sampling period, and at each sampling instant 
n the reset value it n is treated as a new initial condition for updating 
the it (t) - process. In general, the introduction of the control action 
into the detection system will certainly distort the evolution of the pos- 
terior probability, which is derived on the basis of updated observa- 
tions, and the physical meaning of the state it as the posterior proba- 
bility of system degradation is altered. However, an improvement of 
detection performance is made possible precisely due to the fact that 
the information value about the system in the normal mode is of no 
significance m the detection system, and distortion of the information 
for mode Hq is allowable. 
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The admissible control region will now be defined. The uncer- 
tainty information w is investigated at each sampling instant. If the 
system is operated more likely in the normal mode, i. e. , n ^ e (0, 1/2), 
then one is not interested in utilizing the information in the operating 
system for taking any decision and is willing to throw away the pre- 
viously accumulated information. In other words, the uncertainty in- 
formation is only accumulated over a duration of one sample period, 
if the system appears to be in the normal mode. On the other hand, if 
the system is more likely operated in the degraded mode, then no dis- 
tortion, 1 . e., neither "improvement" nor "degradation", of the infor- 
mation value of the system is allowed. In this case no control should 
be applied. The reduction of uncertainty information on mode must 
be derived from updated observations, and the problem is reduced to 
an optimal decision problem. With the above consideration the admis- 
sible control region U is defined as follows: 


u : u{?r ) e R , w e (0, 1/2) 

u : u(?r ) e 0 we [l/2, 1) 


where R is the entire real line, and 0 is the empty set. 


(4-96) 


Before a precise optimal risk function of the detection system as 
a combined optimal problem is formulated it is useful to study the nec- 
essary modification of the optimal risk function form in the original 
decision process and the detection system. At state n the decision 6^ 
to be made for continuation or termination of observations for the 
detection system is defined by 


Mm {Q (w ) , G(.w )} 

0 n n 
n 

= Min {K (7r n ) - J (7r n ) , G(7r n )} < 4 ~ 97 > 

n 

Two modifications of the formulation (4-92) have been made in (4-97). 
The form of the terminal risk for Wald*s sequential analysis was de- 
fined m (4-69) as: 
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a • 

"n 

for decision of Hq 

b • 

(l-» n ) 

for decision of 


Since the decision of only is made in the detection system, the ter- 
minal risk is modified to be: 

K(7r n ) = b • (l-vr n ) (4-98) 

Physically the modification can be simply realized by implementation 
of a one-sided decision boundary. 

The risk function for continuation in the detection system is for- 
mulated in the form Q(?r n ) rather than F(7f n ). The form Q(7r repre- 
sents the total risk for detection of mode (including the previous 
sampling cost -J(7r n ) up to the current state ?r ), if the decision pro- 
cess is terminated at it In comparison Ft 71 ^) represents the respec- 
tive risk, depending on n defined in the region = ( 7r n : 77 n e (°» 1/2)} 
or = { 77 n : 77 n e (1/2, 1)} , to complete the process to acceptance of 
the mode Hq or H.. . The optimal stopping boundary 7 for the one-sided 
detection process can be determined by application of the optimal bound- 
ary conditions (4-91): 


dQ 


d it 


n 


= 0 


7 


(4-99) 


and 

Q (7 ) = G (7 ) 


The condition of the optimal decision process states that it is advanta- 
geous to continue the observation process until the rate of decrease of 
the terminal risk K (7T ) and the rate of increase of the sampling cost 

- J (tt ) are balanced. In this case a minimum total risk is obtained, 
n 

It is noted that for a one-sided detection process, the form G(7r n ) in 
(4-94) to match the optimal decision boundary is simply a constant term. 
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The risk function of the detection system as a combined optimiza- 
tion problem will now be defined. Let the total risk function for a 
given a priori distribution tt associated with a specified admissible 
control sequence {u} and an admissible decision sequence { §} be 
defined as: 


T u-1 

Q u (tt) = E { W (* n ) + K u (*. )> 


(4-100) 


Here r u is the first time (a random number) that the process reaches 

the stopping boundary ir defined by the sequence { s} . If an optimal 

T u 

decision sequence is applied, then the total risk can be reduced to 
Q^(tt ) with the optimal stopping time and boundary denoted as t and 
7r^ respectively. In the specific problem considered, the risk W{?r n ) 
over each step is a constant c; the explicit representation W(7 r ) as a 
function tt is to emphasize the fact that the effect of each step on the 
total risk depends on the step risk c as well as on the state ir n affect- 
ed by the control variable u . The terminal risk K (ir r ) represents 
a measure of the specified false alarm criterion. Since the state 
no longer represents the probability that the system is in the degraded 
mode due to action of the control sequence {u }, it can not be justi- 
fied to assume K (?r T ) in the controlled process to be a linear form in 
7r T as in Wald's sequential analysis. But it is reasonable to assume 


u 


thatK^Xj. ) is a monotomcally decreasing function of the argument 


u 


The detection system can now be formally formulated as a com- 
bined optimal control and decision problem with associated risk func- 
tion defined as follows: 


Mm {G (tt) , Q (?r) } 

(u,5 } - U 

= Min { Min* [G(tt) , Q (tt ) ] } 
{u} {6} 

= Mm { Q^(tt) } = Q^(7t) 

{ u } 


(4-101) 


127 



For any specified control sequence {u} , an optimal risk function 
( 7 r) can be derived by the optimal decision sequence { 6*} . An 
optimal control sequence {u*} is to be derived over all admissible 
control sequences (u) such that 

Q°*(tt) 4 Q* (jr ) ^ Qj Or) 

4. 4. 4 The Suboptimal Control System 

In this section a suboptimal control scheme for the detection 
system is presented, and the suboptimal control sequence in the 
discrete-time case is derived. The difficulty of extension of results 
to the continuous-time system is discussed. 

In the previous section the detection system was formally for- 
mulated as a combined control and decision optimization problem. 
However, it is exceedingly difficult to obtain the optimal solution to 
the combined problem formulation. Unlike Wald's sequential analy- 
sis where no transition of modes is assumed to occur during the whole 
observation process, the detection system is characterized by a tran- 
sition from normal to degraded mode at a random instant. Moreover, 
it is difficult to formulate an explicit form of the terminal risk function 
m the controlled process due to the information distortion by the con- 
trol action. In this section a suboptimal control scheme is presented. 

The control philosophy of the suboptimal system is in a sense 
similar to the well-known open-loop- optimal feedback control (Dreyfus 

The operation can be described as follows. At the sampling 
instant k (k = 1, 2, . . . ), a current measurement is taken. By 
definition of the admissible control region, a control u^ is allowed if 
7!-^. e(0, 1/2), otherwise no control is applied. The control u^ is com- 
puted to minimize the total risk function to complete the detection 
process: 


Q( V u k > 


= E 

•T -1 

> c. + K (tt ) 

JT, 


L 1 U r 

k 


i=0 



(4-102) 


128 



if a degradation will occur right after the current measurement, and 
the system remains in the degraded mode thereafter. Here, the cost 
cn over each step is a constant c, and t is a random number. It is 
assumed that only the current control u^, but no future controls, is to 
be applied. This is justified by the definition of the admissible control 
region since the system is assumed to have degraded after the current 
time, and the process is reduced to an optimal decision process to 


terminate at an optimal stopping boundary. The current optimal con- 
trol u^. is implemented. The operation is advanced to the next sam- 
ple (k + 1). The procedure is repeated, and is implemented. 


The motivation of the design of the suboptimal system is to serve 
as a compromise between computation effort and an efficient control 
scheme. In the suboptimal system the problem of derivation of the 
optimal solution is reduced to a parameter optimization over the cur- 
rent control variable u k , since no future controls are to be applied. 
Moreover, the control is designed on the basis of a specified state ir 
In comparison the control in the real system must be designed at 
every sample instant on the basis of the observed sample state which 
is a random value. 


Since the control must be applied before the occurance of the deg- 
radation, andthe degradation will occur at an unknown instant, the best infor- 
mation one knows whether the system has not degraded is the actually observed 
state at the current time. To design the control under the assumption 
that degradation will occur right after the present stage represents 
the condition that if a degradation does in fact occur, the control will 
give the minimum mean time of delay to detect the degradation under 
a specified false alarm criterion. A precise definition of false alarm 
criterion will be discussed later m this section. The performance 
(4-102) which the control u^ minimizes is not the detection perfor- 
mance of the actual suboptimal system. However, the suboptimal 
system implemented with the control sequence {u^} derived at each 
sampling instant k to minimize a form of the risk function (4-102) 
should give a good detection performance. A quantitative comparison 
has to be derived from the explicit control law, since the latter deter- 
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mines the optimal boundary of detection for a specified false alarm 
criterion. The effectiveness of the suboptimal system will later be 
verified in Section 4. 5 by the analytical detection performance expres- 
sion, and confirmed in Section 4. 6 by the simulation results m com- 
parison with the performance of the non-linear optimal system. 

The optimal control u? in minimizing the risk function (4-102) 

(it is the suboptimal control u k for the detection system) must satisfy 
the basic recursion relation resulting from application of the princi- 
ple of optimality: 


Q* (*r k ) = Min E { [c + ?r k , u k f] | 7r k } (4-103) 

u k e U 


By definition the control is assumed to be executed at the sampling 
instant k in the form of an impulse with size u^. Since no control is 
applied during the sampling interval, the evaluation of the state it 
during the sampling interval A can be treated as the case without 
control. The state equations can then be represented as: 


*k+l = 7r k +u k + ^k 


A 



a 2 (k+l)A a (k+l)A 

^k = J *t (1 ' V? dt + “ J *t (1 " 7 r t ) dw t 

° kA ° kA 


(4-104) 

(4-105) 


The equation (4-105) is derived under the assumption that the system 
has degraded. For any stage i, i e [k+1, t-1], the optimal risk func- 
tion for continuation of observations must satisfy the recursion rela- 
tion in the optimal decision process: 


Q (irj = c + E { Q * (7r i+1 ) | 71^} 
! 

“ c + E ■{ Q* (tt^, + f x ) j 


(4-106) 
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where £ . is defined as in the equation (4-105). Substituting (4-104) 
and (4-106) into (4-103) produces the relation: 

Q*(r k ) = MmE{[o + Ef(at(» k + Uk + ^ k )| »+ (4 _ 107) 

k u 

= Mm Q * (ir, + u, ) 
u k eU K 

The solution is still difficult to obtain in the above formulation. 

Even the randomness of the degradation instant has been removed, 
the solution of the problem requires the explicit form of the non- 
linear terminal risk function which is difficult to define. However, 
the formulation provides a useful approach to the derivation of the 
optimal control. The optimal total risk Q r (7T, ) m (4-107) is obtain- 
ed by application of a sequence of an optimal control u/ and optimal 

. i, * Is. 

»!• f 

decisions (6,.,-,, . . . 6 # . ) to continue until the optimal terminal 
time r ‘ is reached. The risk Q ' (7r ^), though defined at the unknown 
terminal state 7r^*, is by definition composed of two parts: a termin- 
al risk function and an accumulated sampling cost function. It is 
certainly reasonable to assume the terminal risk K u (tt ) to be a de- 
creasing function of the state it even if its explicit form has not 
been formulated. Moreover, the portion of the sampling cost accu- 
mulated up to the current state 7r is of no information value for the 
detection system if the state is defined m the region R q = 

(0, 1/2)} . The analysis suggests that in this case it is advantageous 
to apply some control toward the boundary of the admissible control 
region. A useful information for derivation of the optimal control 

-v | 

is to prove monotonicity of the optimal risk function Q^(ttT) in its 

+ + K 
argument tt k , where 7T ^ = tt ^ + u^. 

The monotonicity property of the optimal risk is to be proved by 
application of the theory of the first passage time of diffusion process- 
es. This will be discussed in detail in Section 4. 5. 2 on derivation of 
the detection performance. Only a brief introduction will be presented 
here. As discussed m Chapter 3 and Section 4. 5, the state equation 
for Wald's sequential analysis can also be represented by the diffusion 
process X(t): 
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d\(t) = — (a - 4 a.) dt + — dw(t) (4-108) 

a 2 a 


^ |x(a) dt +■ bdw(t) 


where the parameter ju(a) is defined as: 


2 

i \ 1 a l 

fi( a l) - 2 

2 a 

for system defined m Hj 

2 

1 a l 

for system defined in H q 


Use of the diffusion process X(t) rather than 7r(t) as the state variable 
is motivated by the linearity of the X(t) equation (4-108). Let t ( A^, B^) 
be the mean time for the trajectory A(t) to exit through an upper bound- 
ary B u under the assumption that the observed process is in the degrad- 
ed mode Hp and has the initial condition A^. The boundary B^ is de- 
rived under a specified false alarm criterion. It is convenient to use 
the mean time T between two false alarms, rather than the false alarm 
error probability, as the error criterion in the first passage time ap- 
proach. It will be shown in Section 4. 5. 1 that there exists a determin- 
istic relation between the two criteria and hence they are mathemati- 

jt/ 

cally equivalent. It is clear from the definition of Q ' (rr , ) that the orig- 

inal problem to prove the monotonicity of Q "(Trp in the argument 7r ^ 

% 

can be reduced to an equivalent problem of proving the monotonicity 
of the meantime of delay t (A , B ) in detection in the argument A 
for a specified mean time T(A , B ) between false alarms. Here A^ 
is the reset state, equivalent to 7r^ in the n (t)-process, by a control 
/■ , i. e. , A u = A^ + £• , where A^ is the sampled value at k of the A(t)- 
process. 

The basic equation of interest for derivation of t(A , B ) will be 
discussed. Let two constants A = 0 and B u > 0 be selected as the 
lower and upper boundaries. The first passage time tty, a) for the tra- 
jectory A(t) reaching the upper boundary B u under the assumption that 
the initial condition A(0) = y / A ^ y < B u , and that the value of the 
parameter a does not change during the observation interval, is shown 
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in Section 4. 5. 2 to satisfy 


1 b 2 diT + |j,( a ) £ = -1 (4-109) 

2 d X dX 

The boundary conditions are derived by the definition of the control 
law. Let the parameter A be defined as follows. When the trajec- 
tory X(t) reaches the value A, a control is applied to reset the value 
to X . If a degradation of the system does in fact occur, then no 
future controls will be applied. The A(t)-process will eventually 
attain the upper boundary B , and the signal for detection of degra- 
dation is given. On the other hand, if the degradation of the system 
does not occur, a control of the same value is applied again to reset 
the value X , whenever the trajectory X(t) defined in the normal 
mode H q drifts back to the value A. It is assumed that the control 
is defined only as a function of the state, not of the time. The con- 
trol operation is repeated until the trajectory exits through the upper 
boundary to give a false alarm when the system is in fact defined 

in the normal mode H . Here one considers the same condition as 

o 

m the optimal detection system described in Section 4. 3, where the 
appearance of the degradation is preceded by a long process of obser- 
vation defined m the normal mode H , in which a stationary regime 
is established, to be interrupted by the occurrence of infrequent false 
alarms. In this case, it is meaningful to define the false alarm cri- 
terion in the established stationary regime of observation m the nor- 
mal mode H . 

o 

This description of the control action, m which a control is im- 
plemented whenever X(t) reaches a fixed value A, is used only for the 
computation of the mean time between false alarms. In the actual 
system X(t) is observed at periodic points t^ and control, if any, is 
implemented at those points. The values taken by the process at those 
times, A k , are random and this precludes calculation of the mean , 
time between false alarms under the action of the control intended to 
be used. The alternate control action described is a reasonable ap- 
proximation to the actual control action, and the approximation is 
especially good if the t^ are closely spaced so X(t) cannot drift far 
from X u between sample points. From the described control law, the 
boundary conditions are expressed as: 
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0 


(B ) = 
u 


t~(A) = t (\ u ) 


(4-110) 


The first passage time for the trajectory reaching the upper 
boundary B can be derived by solving the differential equation 
(4-109) with the boundary conditions (4-110). This can be shown 
to be: 


t (y, a) 


1 

P (a) 


B u -y+(X u -A) 


. , -hB 

/e~ h y - e u 

L' hX u -hA 

' P ~ P 


(4-111) 


under the assumptions of the initial condition A(0) = y, and of the 
parameter a. The parameter h is defined as: 


h - 2 M-(a) 
b 2 


2a - a, 


(4-112) 


The mean time t ( A^, B^) of delay in detection can be simply 


derived by use of (4-111) and (4-112) 
1 


TUu> v p( ai ) 


-A. -B 


B - A + (A 
u u 


- A) ( e - -) 

U V** - a- A ' 


(4-113) 


where the system is defined without controls and is in fact Wald's 
system defined in the degraded mode with initial condition y = A^. 
Since no lower boundary is defined m the system for detection of 
degradation and one notices that: 


lim 

A— 



\~ A \ 
'V - a ) 

- e 


0 


then equation (4-113) can be simplified to: 


(4-114) 
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1 


(4-114) 



B ) 
u 


p(a 1 ) 




The remaining point of interest is then to derive an expression between 
the upper boundary B^ and the value A^ for a specified time T between 
two false alarms through the relation T(A^, B ). The mean time is 
defined as T = E(T_,), the expectation of the time interval T_, between 

' Jc JL 1 

two false alarms under a stationary regime of observation in the nor- 
mal mode. The time T F is defined as the total time elapsed until the 
trajectory defined in the mode H q exits through the first time, and 
a false alarm is signaled. With this definition, the relation -KA^, B^) 
can be derived by an application of (4-111) as: 


T (V 


u 


B ) = 
u 


p(a o ) 


B + (A 
u u 


A) 


(7*-) I 


(4-115) 


where the parameters jit(a) and h are defined to be ii( a 0 ) i n (4-108) and 
-1. The initial condition is assumed to be zero, and the parameter A 
in this case is m fact the "generalized" sample value of the state A^. 
according to the defined control law. 


For a specified T, a relation between B and A can be obtained 

u u 

by derivation of an expression dB^/dA^. Using the condition dT/dA^ = 0, 
one obtains: 


®u 

dJ Si 


I- (e^ - e A ) 2 
f (A u - A) 


1 




) 



B 

(e u - 1) 


(4-116) 


The relation dB dA^ s 0 can be verified by use of the inequalities: 


x > 1 - e x (for x > 0) 

, x , 
x < e - 1 


(4-117) 
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One notices that (X = 0, B >0) 
u u 


K - A > 

<> - e A > 


-x. 


= e 


11 


<\ - A) 


1 - e 


~ A) 


u 


> 1 


(4-118) 


by identifying x = (X - A)= 0, and X^Si 0. The assumption X^>A is 
derived by the information provided by the optimal control problem 
formulation under equation (4-107). The relation (4-119) can also 
be established: 


X 

(X„ - A) e u 

1 


(e" u - e A ) 2 

<e X " - e A > 

(4- 

_/ 1 \ f 

U u- A > .. .,1 

■ > 0 

(a X « - Wl[ 

X - ■ A> ] 1 

X 

by use of the relation (e u 

- e A ) > 0, and the inequalities (4-117). 

Moreover, an upper bound of dB^/dX^ can be derived. Write 


Y = 


(X - A) e u 
u 

, X u Ax2 
(e - e ) 


A. A 

, u A, , 

(e - e ) 


Z = 


K - A) 

(a"? - e A > 


Then one can obtain 
dB 


u _ Y(e U - 1) 

dX u (Z e u - 1) 

B 

Y( e u - 1) 


Y 

< — 


Z(e u - 1) + (Z - 1) 


(4-120) 
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by use of the relation (4-118). It can further be shown that 

' Y < Z (4-121) 

For verification of (4-121)* it is sufficient to show 


(X. - A) 

^OC^A) < (X u " + 1 

■ (1 - e u ) 


u A 

since (e - e ) > 0. It suffices to show in turn that: 


(X. - A) e 

u 


l ( e u - & A ) 


u I 
— - 1 


>= / 




-A). 


Equation (4-122) can be simply proved by identifying 


x “ (K u - A) >0 


(4-122) 


and using the relation (4-117) 


x<e x - 1 


<x > 0) 

Therefore, the following bounds can be established: 


dB 


0 < 


u 


dk. 


< 1 


u 


(4-123) 


The monotonicity property of the mean, time t(X.^, B^) of delay 
in detection can now be derived. Let 


\ = A + L 


S, * A + A, 


(4-124) 


where t and t are two admissible controls defined by the admis- 
u * v J 

sible control region in the X(t)- state space. A simple transforma - 
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tion of the admissible region U in the 7r(t)- state space will give the 
corresponding region U x as U x = { X : X £ 0> . By (4-114), the 
following relation can be obtained- 


T <X U , B u , T) - 


^(aj) 


®u - V 


(4-125) 


t(X , B , T) = 

V V 


^(aj) 


(B - X ) 
v v 


(4-126) 


where B u and are derived for a specified mean time T. The use 
of the explicit argument T in the notation r is to emphasize its depen- 
dence on T. 

Let 


At = t (X y , B v , -T) - t (X u , B u , T) 


Then in order to prove t (X , B , T) to be a monotonic function in X , 

u u u 

it is equivalent to prove the condition At 2 0 for X y = X^. By use of 
(4-123), (4-124), and the condition ^(a^) > 0, one obtains 


At = 


p(a 1 ) 


ii(aj 


< \ - (B u - 


- V 


u 

f dB 
P dX 


dX> 


(4-127) 


< 0 


for X S 
v 


dB 


X^ since < 1. This proves that t(X^, B 


T) is a mono- 


tonically decreasing function in its argument X^. 


It is noticed that the above conclusion can be extended to the case 
for any value A(A= 0), since the monotonicity is proved without refer- 
ence to a particular value A. By identifying the parameter A as the 
generalized sample value of the state X^., the feedback control law of 
the X(t)- process for the suboptimal system in the discrete-time case 
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can be defined as: 


£ 


* 

k 


( V 



k k co 


k k >° 


(4-128) 


The suboptimal control ^" k , corrects the current state X^ to 
ary X = 0 of the admissible control region U^, whenever the 
negative, and no control is applied if 0. The equivalent 
of the jr(t)-process can be defined as 


the bound- 
state ^ is 
control law 



(“ "TO 
2 

if 7T k e (0, 1/2) 


u k (7r u> = ' 

j 


(4-129) 


L 0 

•J- 

if 7 r k e (1/2, 1) 



The suboptimal control u k corrects the current state ^ k to the bound- 
ary 7T = 1/2 of the admissible control region U whenever the state 
7r k e (0, 1/2), and no control is applied otherwise. 

The derived suboptimal control sequence {u^} = {u k , u k+ ^ . . . 
where each u k is derived according to the control law (4-128) or (4-129) 

'is to be implemented m the suboptimal detection system. It is noticed 

-i# 

that each control u? is defined under the assumption that the degrada- 
K + 

tion will occur at the instant k , while the real system will actually be 
degraded at an unknown instant. The detection performance of the sub- 
optimal system will be derived under the implementation control sequence 

si* 

{u^} by averaging over all possible degradation states. This will be dis- 
cussed in Section 4. 5. 3. 


The suboptimal control lav/ will give an intuitive explanation for 
the improvement of the detection performance. Consider the trajec- 
tory of the A(t)-process. The control is equivalent to resetting the 
process A(t) to zero whenever it drifts to a negative value. Thus, no 
accumulation of negative likelihood ratio will exist. This implies that 
£ transition from the normal mode to the degraded mode, if it happens, 
always occurs near the state ^ = 0 in the suboptimal detection system. 
The resets carried out by controls initialize the occurrence of degrada- 
tions in the detection process to an "equivalent" case with no transi- 
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tions. This gives an intuitive explanation of why the control sequence 
can provide an improved detection performance. However, the con- 
trol sequence, on the other hand, increases the false alarm errors of 
the system, if the same decision boundary for detection is held. A de- 
tailed discussion of detection performance will be presented, in Section 
4. 5.3. 

The suboptimal control sequence in the discrete-time case is re- 
alized only at the sampling instants. The logical step toward improv- 
ing the detection performance is to require that controls be carried 
out continuously. However, it is difficult to justify the extension of 
results in the discrete-time case to the continuous-time system. In 
the continuous system, a physically unrealizable continuous control 
u(t 7-^) of infinite size will be required in order to compensate instan- 
taneously the process ir of a diffusion nature as the sampling time 
interval approaches zero. Moreover, in the derivation of the sub- 
optimal control law in the discrete case, the state equation is used 
during the sampling interval as if no controls were applied. But no 
corresponding assumption can be justified m the continuous time case. 
The derivation of the state equation with continuous controls in a non- 
linear stochastic system, and the verification of the conditions on 
existence of optimal stochastic controls (the continuous control in this 
case does not satisfy the uniform Lipschitz condition) are difficult 
tasks to be resolved. 

The suboptimal continuous control problem will then be approxi- 
mated by a control sequence m the sense that there exist control func- 
tions resulting in a risk function Q e arbitrarily close to the risk Q 
in the continuous suboptimal control system. 

(u e ) - (u ) < e e > 0 (4-130) 

It is trivial to find the solution u e . It is the corresponding control m 
the discrete time case with the sampling interval A selected arbitrarily 
small, yet finite. The concept applied here is similar to the one ap- 
proximating a continuous optimal control by a so-called "e- optimal" 
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control. The existence of the e - optimal control has been discussed 

(73) 

in the reference (Stratonoviclr ). 

A block diagram of the discrete -time suboptimal detection system 
is illustrated in Fig. 4. 4. 

4. 4. 5 Identification of Polarity Information 

In this section the problem of polarity identification of the 
mean bias will be discussed. A method to detect the degradation 
mode of unknown polarity by an averaging process will first be re- 
viewed. It will be shown that the detection efficiency is attenuated 
by the averaging process. Moreover the test did not identify the 
polarity sign, which is useful information for the identification stage 
and for the efficient isolation of the degraded gyro as was discussed 
m Chapter 2. It will then be shown that the suboptimal system can 
be simply modified to detect the degradation with simultaneous identi- 
fication of unknown polarity of mean bias, with little additional com- 
putation. 

For testing the mean of unknown sign of a normal distribution 
with known variance a , Wald^^ proposed the following procedure. 

Let a positive value b be specified to define the degradation mode H^: 

10 - e 0 l £ b 


where 0q is the given mean under hypothesis H^. The probability 
density of the sample (x^, . . . , x^) with independent and identical 
distribution under Hq is given by 


P 

On 




b < Xl -e 0 )‘ 
1-1 


n 


<j 


(4-131) 


The density function p^ under is defined as some weighted average 
of the probability densities corresponding to various values of 0 m 
the definition of the degradation mode H^. It was shown by Wald that 
an optimum weighted average is the simple average of two density 
functions corresponding to 0 = 0^ ± b: 
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Fig. 4.4 Block Diagram of Sub-Optimal Detection System 
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(4-132) 
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n 

2 

1=1 


<v 9 



The test is then carried out as follows: Continue taking observations 
as long as A < (P^Pq^ < B. If P ln /P 0n = B, is accepted. If 
Pln^On = H q * S acce Pt ec *- Here A and B are the specified lower 
and upper boundary respectively. 


This is a reasonable approach if one has no more a priori infor- 
mation. However, the averaging process will significantly reduce 
the likelihood ratio sensitivity and thus attenuate the detection effi- 
ciency. This is illustrated in Fig. 4. 5. For simplicity of represen- 
tation 0 q is assumed to be zero, and the probability density function 

is illustrated in a single observation. Suppose an observation x. has 

P 1 (x 1 ) 

been taken, the likelihood ratio function — r of the averaging process 

p cr x r 

1 [ p- (x- | Ht) + p 1 (x 1 i H-.) ] 

will be attenuated by a ratio of — . — ± ± ± ± — ± ± — 

P 1 (x 1 I H+) 


in comparison with the case of the identified polarities. Moreover 
the computation of the likelihood ratio function will be complicated 
by the presence of the addition term in (4-132). 

It will be shown that m the suboptimal system a decision rule can 
be defined to detect the degradation mode with simultaneous identi- 
fication of polarity. The modified detection system is the suboptimal 
system but conditioned on both polarities. Because of the uncertainty 
feedback the suboptimal detection system is defined practically by a 
single- side boundary for detection for degradation mode only. With 
the system conditioned on both polarities two boundaries are defined 
with each one specifying a given polarity. Thus a decision rule can 
be defined for identification of the unknown polarity. For simplifica- 
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tion of presentation. Wald's sequential test process will be used for 
illustration. 

Let the normal distribution of the observations under consid- 
eration be defined as follows: 

Normal mode H q : N(0, a) (4-133) 

Degradation mode with positive polarity H^: N(b, a) 

Degradation mode with negative polarity H^: N(-b, cr) 
where b is a positive number, and a is the given standard deviation. 


Define the likelihood ratio function for degradation mode with 
positive polarity as 


m 


tn 


P<xJ H*) 
P(x i l Hq) 



(4-134) 


and for degradation mode with negative polarity as 
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(4-135) 


m. = 
l 


In 


P(x i ! Hj) 
I Hq) 



(x i + T } 


The test procedure for detecting mode is carried out as follows. 
Compute the likelihood ratio function of the joint probability density 


In 


Pnl'lHj) 4 

P n ('IH 0 ) ' 


n 

2 fa 
1=1 


n 




b] -A 


(4-136) 


= A • 

11 C7 2 


Observation is terminated with acceptance of mode H-^ if 

2 
rr 

B 


A n 


(4-137) 


and observation is terminated with acceptance of mode H Q if 


A <-£- 
n b 


A 


(4-138) 


Otherwise continue the observation. Here B (> 0) and A (< 0) 
are the specified upper and lower boundary respectively. The test 
procedure for detecting mode can be carried out similarly. Com- 
pute the likelihood ratio function: 


In 


P n <' 1 H l> A 
P„(' 1 Hq) 


n 

2 m 

i 


n 

- ( 2 x 
. 1=1 


l 2 ’ 
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(4-139) 

(equation cont'd) 




Accept the degradation mode if 

2 


M n < - 


• B 


and accept the normal mode Hq if 


M > - — * A 


Otherwise continue the observation process. 

Rewrite A and M in recursive forms.: 
n n 


A = A 1 + (x - 
n n-1 n 2 


(4-140) 


(4-141) 


(4-142) 


M = M - + (x + 4 ) (4-143) 

n n- i n & 

Now suppose the real system has been degraded with a mean bias of 

positive polarity, then the value A ^ on average must be positive. On 

the other hand, if A ^ 0, then there are two cases: either the sys- 

n 

tern is operated in the normal mode Hq or in the degraded mode H^. 

In the first case, one is not interested in verification of Hq, and in the 
second case A is not the correct decision function to apply. In both 
cases, one can apply the optimal uncertainty feedback control in the 
suboptimal system to reset A = 0. 

Now suppose the real system has been degraded with a mean bias 
of negative polarity, then the value must be negative. On the other 
hand, if M n § 0, then either the system is operated m the normal mode 
or in the degraded mode H^. In both cases one can apply the optimal 
uncertainty feedback control m the suboptimal system to reset M n to 
be zero. In summary, the following disjoint decision rules can be defined: 
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If A > 0, A is used as the decision function, for the detection 
n n 

system until it crosses the upper boundary. Then one accepts the 
■ 4 “ 

degraded mode and identifies the bias to be positive polarity. 

If M < 0, M is used as the decision function for the detection 
n n 

system until it crosses the lower boundary. One accepts the deg- 
raded mode and identifies the bias to be negative polarity. 

If A <0, {or M > 0), A (or M ) is reset to zero, 
n n n n 

4. 5 Performance of the Detection System 

4. 5.1 Choice of Performance Criteria 

In this section, the performance of the detection system 
is studied. It will be shown that for a sequential detection system 
with only a single- side boundary for degradation detection the mean- 
ingful performance criteria are the mean delay time in detection and 
the false alarm error probability. The definition of the false alarm 
error probability of the detection system is discussed, and a deter- 
ministic relation between the false alarm error probability and the 
mean time between two false alarms is mentioned. 

It has been discussed in sections 4. 2 and 4. 3 that the optimal 
decision problem consists of observing the posterior probability and 
comparing it against threshold (boundaries) to determine the deci- 
sion. The values of the thresholds are dependent upon the selection 
of specific performance criteria. The commonly used criteria for 
decision performance are the false alarm error probability a and 
the miss alarm error probability |3 . It is noted that the error prob- 
abilities a and j3 are defined only when appropriate decisions of Hq 
and have been made, such that the corresponding thresholds can 
be determined. However, for the case of the detection systems dis- 
cussed in section 4. 4, where no explicit decision on the normal mode 
will be made, then the threshold for decision on the normal mode is 
not determined, and only a single threshold for decision on the deg- 
radation mode is defined. It will be shown that m this case only 
the false alarm error probability n is well-defined, while the miss 
alarm error probability /3 will be shown to be zero. The result is 
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to be proved by application of the theory of first passage times for 
diffusion processes. (Darling and Siegert^ 14 h. 

Consider the homogeneous diffusion process A(t) with the tran- 
sition coefficient p and the diffusion coefficient b: 

dA(t) = g dt + b dw (4-144) 

with A(0) = Xq. It is required to derive the first passage time T of 
A(t) to the boundary y > A Q . The first passage time T is defined by 
the events: 

A(0) = \ Q , A(t) < y (0 < t < T), A(T) = T (4-145) 

It is convenient to study the first passage time by the use of an ab- 
sorbing barrier at y. Let the transition probability density be de- 
noted by 

pU 0 A t).= p (Xq, t Q ; A,t) (4-146) 

The density function p(A Q ; t) represents the transition probability 
density 'that A(0) - A^ and A(t) = A and that the process A(* ) does not 
reach the barrier y in the time interval (0,t). It is well known that 
the transition probability density satisfies the forward and backward 
Kolmogorov equations: 



{b 2 • p(A 0 At)} - {p ■ p(A QJ A, t)} 

8A 


ap(A Q ;A J t) 

at 


(4-147) 


and 


J _,2 

2 b 


BK 


{p(^ 0 A,t)}+p 


{ p( Aq, A, t)} = 


9p(\pM) 


at. 


at 



M) 


(4-148) 
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Let 

P(/V Q ,A,t) = Pr { A (t) < 7 , for 0 < t < t, A(t) 2 A |A(0)=A Q } 

A 

= / p(A Q , x;t)dx 

-« (4-149) 

P(A Q , 7 ;t) = Prob. (Tit) 

The distribution function P(Aq, A, t) also satisfies the backward equa- 
tion, as can be seen by integrating over the forward variable in the 
backward equation. 

The backward equation is the appropriate one to use for passage 
times since it is required to determine the passage time distribution 
as a function of the initial state Aq for a fixed terminal state y. Let 
g(t 1 A^y) be the probability density function of T, then 

g(t|A 0 , 7 ) = P(T = t) = — |^-P(A 0 , 7 ;t) (4-150) 

Substituting (4-150) into the backward equation for P<Xq, K t) produces: 
2 

~ b 1 2 A_ { g(t | A n ,7)} + M _L_ { g(t | A n ,7)} * -5- { g(t 1 3L , 7)} 

2 8 x 2 0 , sy . 8t * 

0 (4-151) 

The partial differential equation (4-151) can be solved by Laplace 
transformation. Define the transform of g(- ) as 

00 st ~ ' 

l(s|A n ,7)=; e -st g (t ] /U , 7 ) dt (4-152) 

0 _ .. u 

To ensure the existence of the limit of "g (s| Ag,y) the condition Re(s) 

S 0 must be satisfied. Equation (4-152) is transformed into the or- 
dinary differential equation: 

1 b 2 ii + K if- = S g - g(t 0 IX 0 ,7) (4-153) 

dA 0 

0 
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But the term g(t Q | 7 ) = 5(7 “ X Q ) is zero, if X Q < 7 . Then the gen- 
eral solution of which is: 


where 


where 


g ( X n ) = A (s) e 


X Q e i (s) 


+ B(s) e 


W s) 


(4-154) 


g ( X q) " S ( s l X 0 > 7) 


e,(8) . 0 o (s) = r» + 


' (4-155) 


e 1 (s) < o < e 2 (s) 

e 1 (o) = - /b 2 , e 2 (o) = o 

6 1 (0) = 0 , 0 2 (O) = 2 l/Lti /b 2 


(Re (s) > 0) 

(ju > 0) (4-156) 

(h < 0) 


To determine the coefficients A(s) and B(s), it is noted that due to 
boundedness g(X Q ) the coefficient A(s) must be zero, otherwise 
g (X Q ) ->■ oc, as Xq-^-oc (Re(s) g 0). Furthermore for X Q = y, then 
by definition, absorption occurs immediately and so g(-y) = 1 . This 
gives 


g ( x 0 > = g (s 1 * 0 ,7 ) 

( x Q - 7) • e 2 (s) 


( X 0=7) 


(4-157) 


Define 7 t(Xq, 7 ) as the probability of ever reaching y when start- 
ing from Xq < y, then ^(X Q , 7 ) can be simply evaluated from (4-157) 
by setting s=0. 


00 ~ 

. it ( X. 7 ) = / g (t I X ,7) dt 

0 0 (4-158) 

= 1 (0 IX Q , 7 ) 
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Substituting (4-156) and (4-157) into (4-158) produces: 


7T ( A Q , 7 ) = 1 

- 2 ( 7 - X Q ) lhi/b 2 
e ° 


(M § 0) 
(M < 0) 


(4-159) 


This proved the following statement. For the diffusion process A(t) 
with the transition coefficient p and the initial condition Aq, the prob- 
ability of ultimate absorption of the process A(t) m the boundary 
7 (7 > Aq) is unity when there is a drift toward the boundary y(/u > 0) 
or no drift (/ j = 0). When the drift fj. is away from the boundary (p < 0) 
there is a non- zero probability of reaching the boundary 7 . 

The above result can be immediately applied to the case of the 
sequential system with a single threshold for detecting the degrada- 
tion H^. For simplicity of presentation the linear system of Wald 
SPRT will be considered. In this case the detection procedure con- 
sists m observing the likelihood ratio function A(t) and making deci- 
sion if the process A(t) exits through the upper boundary 'V'. Con- 
sider the stochastic differential equation of A(t): 


d A (t) = - 1 — L dt + dy (t) 
2 a 2 a 2 

and the observation process iS‘ 
dy (t) = a^ dt + cr dw(t) 
dy(t) = a dw(t) 


A(0) = A q ( 4 -i60) 


(y (t) e H 1 ) 

(4-161) 

(y (t) e H q ) 


Substituting (4-161) into (4-160) yields: 


2 

cl cl 

dA(t) =-| |-dt + — — dw(t) (y(t) e H^) 

a cr 

(4-162) 

1 a i a l 

dA(t) = ~2~ 2 "dt + dw(t) (y(t) e Hq) 

cr cr 


with A(0) = A q 
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On identifying (4-162) with (4-144), it can be seen that fj. = 

1 a l 

~2 — 2 ~> 0 and Aq < y for the system defined m the degraded mode 
u 

H^. Thus for the sequential procedure where the detection time 
is not fixed, the probability of ultimate detection is unity, i. e. 

/3 = 0. However, if the real system is defined m the normal mode 
such that there exists a negative drift rate, there is a non-zero 
probability of reaching the upper threshold y 1° make the false 
decision on the degradation mode Moreover suppose the real 

system is defined as: 


dy(t) = a dt + a dw(t) 


(4-163) 


where a is an unknown parameter. 

Substituting (4-163) into (4-160) produces 

cl cL 

dX(t) = ■ ■■■ £ • (a - 4j- a^) dt + — — dw(t) 
a <j 


(4-164) 


A(0) 


Assuming that a^ > 0, the relation 

a 1 

V = ~2 (a "T a l ) = 0 (4-165) 

CT 

implies that a 

A simple application of the previous analysis will lead to the 
following statement: If the instrument is defined to be operating 
m the degradation mode on the condition that the real output data 
of the instrument show a mean bias magnitude greater than half 
of the "design" value a^, then the miss alarm error probability 
will be zero in the single side sequential detection system. Thus 
the meaningful performance criteria in this case are the mean delay 
time m detection and the false alarm error probability. 
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The false alarms are presentation of degradation signals when the 
boundary of detection for degradation was reached., while the opera- 
ting system is defined m the normal mode Hq. The false alarm error 
probability a of the detection system is defined as follows: 


a = Pr(y < 0) 

(4-166) 

= J P(v < t | H q ) * dP 0 (t) 


In the equation v is the time of decision that a degradation has occurred, 
while 0 is the instant of occurrence of the true degradation with the a 
priori distribution: 

P(0>t)=e" qt (4-167) 


Two more criteria of characterizing false alarms are of interest: 

N: the mathematical expectation of the number of false alarms be- 
fore the time of the occurrence of a true degradation. 

T: mean time between two false alarms. 

For the assumed distribution (4-167), a simple deterministic rela- 
tion N, T and a can be developed. 

By use of (4-166) and (4-167) the following relation can be established: 


(1- a) = P (v > e) 

= / P 0 <v § t)dP e (t) 

0 (4-168) 

QC 

= q / P n (i/S t) e" qt dt 
0 u 


where 1?{v= t 1 Hq) is denoted as Pq(i/= t). Expanding the exponential 
e q ^ produces 

(— ) = f* P n .(v§t) dt + 0 (q) (4-170) 

q 0 U 
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Using definition T in (4-173) and integration by parts one can show 


f P n (v ? t) dt 
0 u 



t d P Q (v < t) 


A 


T 


(4-170) 


Substituting (4-170) into (4-169) provides 

(— ) = T + 0 (q) 

q 

As q 0 with T fixed, i. e. , the mean time of occurrence of a true 
degradation goes to infinity, then it is reasonable to assume that a -*■ 1. 
In this case for any small e > 0 there exists a such that for q < q^ 
the following relation holds true: 

| (il£) - T | < e 

aq 

Thus for q < q-^ the relation can be established: 

(il^) ~ qT (4-171) 

a 


A deterministic relation between N and a can also be developed. 

The detection system is characterized by the multi-stage decision 

process. At each time an alarm signal occurs a verification is to be 

made by the identification system. The detection process terminates 

if a degradation is verified, and repeats if a false alarm is discovered. 

Because of the useful property of the exponential distribution: 

* 

p(e>s+ v i | e > p.) = p (e > s) 

t 

The conditional distribution of the process 9, given that a false alarm 
has been discovered at an instant y., coincides with the unconditional 
distribution, even if y i may be a random variable. Therefore until a 
true degradation has been verified, each detection stage can be regard- 
ed as identical and independent with the only difference being that one 
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more false alarm has been discovered. 

Let A be the event that i false alarms have been discovered before 
a true degradation has been verified. Then by definition 


N = S iPr {A} 

i=l 1 

cc i 

= 2 i a (1 -Of) 

l-l 

M * £ , (i— 1 ) 

= cul-Q') Si a 

1=1 

= 01 
1-n 


( 4 - 172 ) 


Since the case of most interest in practice is the one q -*■ 0, then the 
value of the error probability a of false alarms m detection system 
is usually incompatibly large in comparison with the specification of 
the false alarm error probability a in the Wald's sequential analysis 
process. In the Wald's case, the error probability a is always speci- 
fied to be less than 1/2, and is usually specified to be an order of 
1-5%, Thus for detection systems, the parameter T is usually the 
more meaningful criterion for the false alarm error specification. 

Since the parameters T, N and a are related by the deterministic 
relations, they are equivalent m mathematical formulation of the de- 
tection system. 

4. 5. 2 Derivation of Performance of Optimal and Suboptimal Systems 

Two approaches can be applied for derivation of the performance 
of the detection system. One is to formulate the detection system as 
a Bayesian problem and the performance is derived by solution of the 
Bayesian risk function (Stratonoviclr , and Shiryaev' ’). The other 
approach is to derive the performance by application of the theory of 
first passage times for diffusion processes. The latter approach will 
be used in this thesis, since it gives more physical insight and gives 
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a unique formulation for comparison of performance of the optimal 

and suboptimal system. The solution to the first passage problem for 

a diffusion process was given m an early paper by Darling and 
(14) 

Siegert . The result was extended to the solution of the perfor- 
mance (operating characteristics) in Wald's sequential analysis. 

The sequential analysis was originally formulated m the discrete 
form by Wald^^. The exact solution of performance is analytically 
difficult. Wald solved the problem by developing a fundamental iden- 
tity (Wald's identity) and by neglecting a possible excess of the like- 
lihood ratio function over the decision boundary at the final absorption 
point. Darling and Siegert formulated the problem by approximating 
the discrete process in a continuous stochastic process, and derived 
the asymptotic solution by application of the Kolmogorov equations of 
a diffusion process. Aside from the difficulty of justifying the limit- 
ing operations, it turns out that this procedure leads to simple solu- 
tions. In this section the general formulation for the derivation of the 
detection system performance will first be presented. The mam re- 
sults of the optimal system performance by Shiryaev will then be men- 
tioned. Finally, the derivation of the suboptimal system performance 
will be developed. 

The problem of solution of detection performance is to derive the 
mean time delay in detecting the degradation under the constraint of a 
specified mean time between two false alarms. The problem is to be 
formulated as follows. Suppose the observation process y(t) defined in 
the normal mode is observed. With each sample function of the ran- 
dom process y(t), a random stopping time v of detecting a "degradation" 
is observed. Associated with a whole ensemblance of observations, 
one can define the distribution F(u) of the stopping times v. 

Now consider the multi-stage observation method for the detec- 
tion system defined in the normal mode Hq. The first stage of obser- 
vation is defined in the usual manner. If after the first stopping time 
v ~ v -y, the observation is resumed according to the same rules as in 
the first stage of observation, and independently of results of the first 
stage, then the stopping time does not depend on v ^ and has the same 
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distribution F(u). A similar consideration applies to the duration v ^ 
of subsequent stages. 

Let 


= max { k: + . . . < t} 


P t (u) = P{ [t - (v 1 + . . . v k )] S u } 
cc 

T = E n (v) = J u dF (u) 
u 0 v 


(4-173) 


where E Q (* ) means the expectation taken with respect to the observa- 
tion process defined in the normal mode Hq. It is assumed that the 
appearance of a degradation is preceded by a long period of observa- 
tions of the system in the normal mode, then in the case T<cc, a sta- 
tionary process of observations in the normal mode can be assumed to 
be established m the sense that 

1 - P(u) = lim P { [ t-(v- + . . . v ) ] > u} 
t-oe K t 

(4-174) 

= J (1-F y (x))dx 

The equation (4-174) was proved in an early paper on renewal theory 
(19) 

by Doob , in which P(* ) is defined as the distribution function of the 
initial age, and F<* ) the distribution function of lifetime. 

Let 

(u) = E(v - u 1 v £ u) (4-175) 

where the conditional expectation is calculated under the assumption 
that the observation began at t=0, and is performed according to the 
decision rule 6, that a degradation appears at time t=u, and that v = u 
is the first stopping time. Then the quantity 
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(4-176) 


6 

T t 


t 

(T) = I 


0 


E 6 (u) d P t 5 (u) 


characterizes the mean time delay in detecting a degradation which 

appears at the time 0=t, if the decision rule 5 is used with a specified 

mean time T defined in (4-173). It is noted that the distribution pf(u) 

t 

is calculated under the assumption of the system in the normal mode, 
and (u) is conditioned on the assumption that a degradation has ap- 
peared. 

Define the quantity (t -* oc) 


t 6 (T) = 



(4-177) 


as the mean delay time in detecting the degradation appearing against 
the background of an established stationary process under the normal 
mode. The problem is then to find the performance of the optimal 
method for which 


t(T) = mfr 6 (T) (4-178) 

6 

for given T. 

For clarity of presentation, the basic steps in performance der- 
ivation will be first discussed. It has already been shown that the op- 
timal detection system consists of observing the posterior measure 
7r(* ) (or X(- )) and evaluating it against the threshold for detection. The 
diffusion process M ‘ ) will be used in performance derivation because 
of its simpler expression. Let Aq(s) and A.^(s) be the notation of the ran- 
dom process A(s) conditioned on the system defined m mode H Q and 
respectively. The equation (4-177) will be explicitly written as: 

t(T) = f t(y, T)p(y)dy 

•'0 (4-179) 

where p(y) is the one- dimensional stationary density function of the 
probability distribution established by the process Aq(s). The quantity 
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t(y, T) is the mathematical expectation of the first passage time at 
which the process A^(s ) exits through the set decision boundary, 
under the assumption that A^(0) = y at the time of appearance of the 
degradation with a specified mean time T between false alarms. 

Since only the time delay after degradation is of interest, the degrada- 
tion is assumed to occur at the instant s = 0 . 

Let the homogeneous diffusion process Aq(s) be defined as 

dA Q (s) = ds + ct o (A 0^ s ^ dw(s > (4-180) 

/ 

It is noted that Mg(* ) aa d °q{‘ ) are not explicit functions of the time 
variable, as it has been shown for the process X(* ) m the untruncated 
case in section 4. 2. Then the transition density function p^ (Ag,y, 
satisfies the Kolomogorov equations associated with the diffusion pro- 
cess /' Q (s). Since one is interested m the probability density of the 
forward variable y in (4-179), then it is appropriate to apply the for- 
ward equation. If the threshold for detection is denoted as 7 , then it 
is equivalent to have an absorbing barrier at y = 7 . Due to the defini- 
tion of the transition probability of the forward variable, the appropri- 
ate boundary condition at 7 is. 

p. ( 7 J 7 , s) = 0 for s>0 (4-181) 

0 

In the stationary case, the transition density function p(Xq; 7 , s) 
will approach the stationary density p(y) with the appropriate bound- 
ary condition p( 7 ) = 0. This gives the approach for derivation of the 
density function p(y). 

, The distribution of the first passage time for the random process 
A^(s) is a rather complicated expression. However, if the first pas- 
sage time is a proper random variable, that is, absorption (detection) 
is a certain event, then the moments of the passage time can be shown 
to be the solutions to simple differential equations by the following 

(14) 

theorem for the homogeneous diffusion process (Darling and Siegert' 

Let A^(s) satisfy the homogeneous diffusion process: 

dA 1 (s) = q 1 (A 1 (s))ds + a^A^s)) dw(s) (4-182) 
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with ,\^ (0) - y, E > y > A, and A and B are the boundaries. Here ju^(' ) 
and cr 1 (* ) are also not of explicit functions of time. Define the first pas- 
sage time T AB (y) {crossing either the boundary A or B) as the random 
variable: 


t ab (y) = SU P ft I A < A 1 (s) < B, 0 5 s g t} (4-183) 


Then if T AB (y) is a proper random variable whose moments of order 

n i n Q exist, t ^ = t^g(y)* the nth moment of T AB (y), satisfies the 
system: 


1 

2 


2, \ 

^(y) 


d 2 t^ 



+ p L (y) 


dt {n) 

dy 


-n 


^n-1) 



(4-184) 


t (n) (A) = t (n) (B) = 0 n > 0 

AB AB 


The theorem can be proved by substituting a series expansion for the 
moment generating function into the backward differential equation, and 
equaling the corresponding coefficients to zero. Since one is interested 
in the first passage time distribution to a fixed state of absorption as a 
function of the initial position A^(0) = y, the backward equation provides 
the appropriate method. In particular if one is interested in the case 
of mean time for which n-1; then (4-184) is reduced to 


1 

2 


o\ 


(y) 



+M 1 (y) 


dt 

dy 


-l 


t (A) = t (B) = 0 (4-185) 

The function ?(y, T) in (4-179) can be thus found as the solution to the 
equation (4-185). 

With the general approach as the background, the performance 
of the detection system will be derived. 
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(1) Performance of the Optimal System 


The performance of the optimal system has been derived 
by Shiryaev^ 64 ^. Here only the main results will be mentioned for 
reference, the details are left to the original paper. 

It has been shown m Chapter 3 and section 4. 3 that the optimal 
method for detecting the degradation is based on observing the sto- 
chastic equation: 


2 

dX(t) = [q(e" X{t) + 1) |-]dt +-|-dy(t) (4-186) 

a cr 

with the initial condition X defined as 

>-=^n( T ? F ) (4-187) 

where tt is the a priori probability that the system has been degrad- ' 
ed. In Shiryaev's derivation it is assumed that ir -+ 0, then X -*■ -t». 
The optimal boundary y for detection can be derived from (4-47) in 
section 4.3, as 


y = in LlZlSl'l l\ = |n(il£.) (4-188) 

1 Qf(l-TT) f ° 


but it has been proved in (4-171) 


= qT (4-189) 


Then T = ^(qT) (4-190) 

Define 


p(t) - y - X(t) 


2 

A 1 a l 

r ~~2~2~ (4-191a) 

cr 
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(4-191b) 



(4-191c) 


The equation (4-186) can be rewritten as: 

dp (t) = r(l - g - ce p (t) )dt \ dy(t) (4-192) 

u 

with the initial condition, 

p (0) = oc 

and a degradation is signaled at the boundary y and p (y(t)) = 0. 

The stochastic differential equation pQ(t) for the system defined 
in the normal mode Hq can be written as the diffusion process: 

Pg(t) a i 

dp Q (t) = r(l - g - ce )dt ^-dw(t) (4-193) 


since the observed process in the normal mode can be written as 

dy(t) = cr dw(t) 

The above equation can be further simplified by the transformation to 
a natural dimensionless scale s for time t: 

s = rt (4-194) 


dp Q (s) = (l-g-ce° ) ds - VYdw(s) 


(4-195) 


Pn(0) 


and a false degradation will be detected at the boundary y with p Q (y(s)) 
= 0. The density p(y) can now be found by the forward equation: 
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JLg. { 2 p(y)} - -§- { (1 - g - C e y ) • p(y)} = 

9y - (4-196) 

The equation, for the stationary density function p(y) can be sim- 
plified to an ordinary differential equation with a variable coefficient: 


— - — 1. { (1 - g- ce y ) p(y)} = 0 (4-197) 

dy 2 ^ 

The unknown constants of the differential equation are removed by 
the condition (4-181) at the absorption boundary 7=0, 

p(y) = p(0) = 0 

and the normalizing condition, 

cc 

J P(y) dy = 1 
0 


The probability density p(y) can be solved as 


p(y) 


f o (y) , y _f o (x) . 

e J e dx 

0 

oc fg(y) y -f 0 (x) 

/ e U [ / e dx] dy 

0 0 


(4-198) 


z 

where f n (y) = J (1 - g - ce ) dz (4-199) 

u 0 

~ A ~ 

To derive the function t(y, T) = t(y), the stochastic differential 
equation for the system defined in the degradation mode must be 
used, the equation p^(s) can be easily derived from (4-192) to be 

Pi(s) 

d p^s) = (-1 - g - ce 1 ) ds - v 2 dw(s) (4-200) 

by using the relation 
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dy(t) = a^dt + c dw{t) 


and by the time scale transformation (4-194). 

The differential equation for t(y) associated with the diffusion 
process (4-200) can be expressed as 

(- 1 - g- c e y> -1 (4-201) 

dy dy 

with the boundary conditions, 

t(A) = t(B)= 0 (4-202) 


where A = 0, and B -*• oc. 

Solving (4-201) produces 


y oc f (x) -f i (z) 

t(y) = / [ / e 1 dx] e * dz 

0 z 


where 


(4-203) 


Mx) = / (- 1 - g - c e z )dz 

0 

= - (1 + g) x - ce x + c 


(4-204) 


Substituting (4-198) and (4-203) into (4-179) the detection performance 
in the time delay for the case g = 0 can be explicitly obtained by some 
manipulation: 


t(c) = 


OC . t z t 

e -t ( / — 2 dz) {/ e 
c c z c 


oc -u 

dz ( / — 2 ~ du) } dt 
z u 


oc , t u (4-205) 

J e _t (/ -^5-du) dt 
c c u 
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A further simplification has been worked out in details in the refer- 
ence. The final result can be shown to be: 


t(c) - e c (-Ei(-c)) - 1 + c / e cv l n (l + v) j v (4-206) 

0 v 

where the exponential integral is defined: 


oo -u 

-Ei(-c) = / — du c>0 (4-207) 


For the particular case of interest of sufficiently small values 
of c, i. e. T is large, (4-207) can be expanded into the asymptotic 
series: 


-Ei(-c) = - 7 - in 


00 

+ 2 (- 1 ) 
n=l 


n+1 


n 


n: n 


(4-208) 


where 7=0. 57721 is Euler's constant. 

Using (4-208), the asymptotic solution can be simplified to be: 

t (c) = -£nc-l-y + 0(c) (4-209) 

This is the optimal performance of mean delay time r (c) in detecting 
the degradation as a function of the parameter c. Substituting (4-191) 
into (4-209) and transforming back from the natural scale to real time 
scale, t (T) can be expressed as 

2 2 

r(T) = { in [-—• T] - 1.5772 + 0(-?§ )} (4-210) 

a^ 2a a^ T 

where T is the specified mean time between false alarms in the real 
time scale. 

(2) Performance of the Suboptimal System 

The derivation of the performance of the suboptimal sys- 
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tem is approached by solving the first passage time of an equivalent 
linear detection system with some modification of the boundary con- 
dition and application of a limiting operation. Consider the stochas- 
tic differential equation of Wald's linear detection system: 

a l 1 

dA(t) = — y [ dy(t) dt ] (4-211) 

CT 

The observed process is defined as 

dy(t) = a dt + a dw(t) (4-212) 

where a is the true unknown parameter defining the operation mode 
of the system, and the parameter a is specified as a^ for mode 
and as zero for mode Hq. 

Substituting (4-212) into (4-211) gives 

ci a 

dX(t) - — g- (a --g-a^)dt 4 — — dw(t) 
o 

= p(a) dt 4 - b dw(t) (4-213) 

For Wald sequential analysis procedure, the decision rule is de- 
fined as follows: Two constants Ago and B = 0 are selected by the 
set of error probabilities. The decision rule is of single stage, the 
detection system terminates whenever A(t) reaches the lower boundary 
A or the upper boundary B. The first passage time for Wald's system, 
under the assumption that A{0) = y, and that the value of the parameter 
a does not change during the whole observation interval, can be ob- 
tained by solving the following differential equation: 

1 .2 d 1 2 t . . dt _ _i 

~2 b ~Y + M(a) "dy " (4-214) 

dy J 

with the boundary conditions t(A) = t(B) = 0. 

Notice that with assigned boundary conditions, there is no guarantee 
that absorption m the boundary B occurs before absorption in the bound- 
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ary A. To guarantee the absorption in the boundary B, the decision 
rule is modified as discussed in Section 4. 4. 1. when the process A(t) 
attains the value A, the process is reset to A=0 and the observation 
is continued. When the process attains the value B the alarm signal 
for the detection of disorder is given. If the alarm is verified to be 
false, the process is renewed again from the point A=0, and so on 
until disorder is detected. 

From the described decision rule, the correct boundary condi- 
tions are expressed as 


t(B) = 0 
t(A) = t(0) 


(4-215) 


The first passage time for the trajectories reaching the upper 
boundary B for detecting a degradation can be derived by solving the 
differential equation (4-214) with the boundary conditions (4-215). 
This can be shown to be 


t (y, a) 


h( a) 


(B - y + A 


e- Bh -e' yh 


1 - e 


-Ah 


(4-216) 


under the assumptions of the initial condition A(0) = y and of the para- 
meter a. The parameter h is defined as 



2a - a i 


a 


1 


(4-217) 


The mean time T between two false alarms is defined as the mean 
time for the trajectories A(t) to exit through the upper boundary B under 
the assumptions that the observed process is of normal mode (a=0), and 
the initial condition A(0) = 0. With these definitions, the meantime T 
can be derived from 1(4-216): 

B 

T = t(0,0)=^y • (B + A ) (4-218) 
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The detection performance of the suboptimal system can be derived 
as in (4-179) by: 


B „ 

r(a t , T) = J t (y, a 1 , T) p(y) dy (4-219) 

. 4 

Here p(y) is the probability density of the process A(t) = y at the in- 
stant of degradation. As in the optimal case, it is assumed that for 
this instant there exists a stationary condition, established in the ob- 
servation process in the normal mode. The density p(y) is to be 
solved by the Kolmogorov forward equation. The final result can be 
shown to be: 


p(y) = 


(l-e B )(e A -y-l) 

A(l-e B ) + B(e A -l) 

(e A -l)(l-e B ~y) 
A(l-e B ) + B( e A - 1 ) 


A = y < 0 


0 <y IB 


(4-220) 


The derivation is based on an application of the method of Green's 
function and will be presented m Appendix B. 

The performance of the suboptimal system can now be derived 
by a simple limiting operation of the above general formulation. In 
the suboptimal system the linear stochastic system is designed with 
a feedback of uncertainty information defined by X(t). In the discrete- 
time system, the sampled value at each sampling instant is actually 
a random number, which is reset by the control to a value zero. It 
is difficult to derive the detecxion performance when the controls are 
applied at boundaries of random values. Instead, the performance is 
derived under the assumption that the controls are applied at a con- 
stant boundary A. The co'nstant value J A may'be regarded as the ex- 
pected value of the drift over one sample interval of the A(t)-tra;jectory 
defined in the normal mode. The assumption of a constant value A 
can be further justified if one considers the case where the implement- 
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ed controls are u e - controls defined in Section 4. 4. 4, and are executed 
over very small sample intervals in approximating the continuous sub- 
optimal control law. In the limiting case of the continuous-time system, 
the control will almost instantaneously reset A(t) = 0 whenever A(t) < 0. 
This is equivalent to observing the linear system with the lower bound- 
ary A set to zero. The detection performance is derived on the basis 
of this limiting case. In this case one obtains from (4-218), 


T = lim — — 

Ah>0 h(0) 


(B + A 



= — - ' (B - e B + 1) 

H.C0) 


(4-221) 


Let the upper boundary under the case A = 0 be denoted as B^ . 
The equation (4-216) can be rewritten as 


?(y,a,T)«L_ . [ B ,-y + (4-222) 

M(a) h 

Define x = B^-y, the following relations can be established by defini- 
tions: 


2 2 

9- cl 

h(a x ) = 1, m( 0) M( ai ) - ^ 2 " 

2n 2a 


(4-223) 


.2 B„ 


T = (e^ 1 - B- - 1) 


(4-224) 


t(x, a x ,T) = 


2a 


_ g 

[x+e ^l-e*)] 


(4-225) 
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p(x) 


<e A -l) (l -e B -y> 


* lira 
A-0 
B'*B l 


A(l-e B ) + B(e A -l) 


0 g y < b 


B,-y 

(l-e 1 ) 

Si 

l-e +B, 


(e x -l) 

B 1 

e 1 - B x - 1 


OS y S B 1 

0 = x - B 1 (4-226) 


The detection performance T(t) can be derived by the simple integra- 
tion of (4-219): 


r(t) = t (a^ T) = 


/ fca^T) 

0 


* p(x) dx 








- 2 


+ e 



)> 


(4-227) 


A simple transformation of (4-292) will give an interesting com- 
parison between the detection performance of the optimal and subopti- 
mal systems. 


Consider the case that T is large which will be of main interest, 
an approximate value for the upper boundary B^ can be derived 


T = 
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or 




T) 


(4-228) 


The detection performance (4-227) of the suboptimal system under 
this condition takes then the form: 

, t(T) = 


The detection performance of the suboptimal system has thus been 
shown to be of a remarkably good approximation to that of the optimal 
system as expressed in (4-210): 

o 2 a. 

t(T) = iiL { £n (— L . T) - 1. 5772 + 0 (— Z_) } 

2 2 2 „ 

2cr a j T 

An analysis of this result will be discussed in the next section. 

4. 5. 3 Discussion of Detection Performance 

In this section the performance comparison of the optimal and 
suboptimal detection system will be analyzed. In Section 4. 4. 4 the 
improvement of performance in the suboptimal detection system has 
been intuitively explained as a result of control actions that reset the 
system state to the null- discrimination state whenever the system 
is operated m the normal mode. A more detailed analysis will be 
presented in the following discussion. 

In deriving the performance of the detection system the problem 
was formulated as a first- passage time of the diffusion process in both 
optimal and suboptimal systems. This gives a better insight for per- 
formance comparison. Consider the basic equation of the mean time 
delay (4-219): 


A~) * (B. 


~) 

2 


(4-229) 


(2o__) [ lat-ig-T) -•§•] 

a H 2a 
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(4-230) 


r B ~ 

t(T) = / t(a., y, T) p(y) dy 

A 

There are two fundamental factors which enter into the basic equa- 
tion: the behavior of the stochastic differential equation A(t) and the 
boundary values A and B. Each factor will be analyzed in both opti- 
mal and suboptimal systems. In the following the system stochastic 
equations will be expressed in the dimensionless natural scale s. The 
following transformation has been defined in (4-191): 



a 


g = -l (4-231) 

r 

s = rt 

A. The St ochastic Differential Equation 

In the optimal system the stochastic differential equation 
can be expressed as 

* * * - 

dA(s) = (- 1+ g + g e _ ^ S ^)ds + (~)ds+/2 dw(s) 

1 . , (4-232) 

Let the a priori probability of degradation be denoted by ir , then the 
initial condition of A(s) is defined as 

The behavior of the process A(s) will be studied. It is assumed that 
g is negligible, i. e. , the instrument has a long mean life time. How- 
ever the term g e ^ S ^cannot be neglected, since A(s) can be a large 
negative number. The equation (4-232) can be rewritten as 

d A(s) = (g e" X(s) -l)ds + (l^)ds +VT dw(s) (4-232) 

a. 
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Denote X g = fn g. Consider the term in the brace of (4-233) around 
X(s) = X g . There are three cases: 

(1 ) X(s) = X , then the term 

6 

(ge" X(s) - 1) = (ge Xe - 1) = (g - 1) = 0 

& 

tends to zero, i. e. , this term contributes nothing at the point X(s) = X . 

(2) X(s) < X , let X(s)=X - 5 X, §X > 0. Then the term 

0 s 

<ge~ X(s) -l) = <e 6X - 1) > 0 

contributes on the average a positive trend for X(s) through the transi- 
tion coefficient, and gives a compensation effect. 

(3) X(s) > X , let X(s) = X + 6X, 6X >0, the term 

0 0 

<ge" A(s) - 1) = (e' 6X - 1) < 0 

contributes a negative trend. In particular, if 5 X is large, then the 
term e 6 ''’tends to zero. This will be the situation when the system 
is operated in the degradation mode. 

The fact that the process X(s) oscillates about the state X g = i*n g 
provides a convenient and interesting specification for the initial condi- 
tion X. In general for any practical system the initial state of degrada- 
tion 7T should be a small value, this means that the initial state X tends 
to be a negative infinite magnitude. From the above analysis in case 

(2), the trajectory X(s) will converge rapidly toward the value X through 

-X ® 

the contribution of the large value of the term (g e -1). For the case 
of practical interest a degradation will occur only after a long time of 
normal operation, then in evaluating the detection performance it is 
unnecessary (and usually difficult) to get accurate information about 
the initial state. A convenient and reasonable specification for the ini- 
tial state can be assigned to be 

X = fn g (4-234) 

Next consider the corresponding stochastic equation of the suboptimal 
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system: 


2 

3 . 3 . 

dMty (fit) - 1 -l_)dt + -1 dy(t) (4-235) 

C* Li Lk 

a a 

Transforming (4-235) into the natural scale by (4-231) produces 


dX(s) = (f( s) -l)ds + (— -) ds + yf% d w (sj 

a. 


(4-236) 


It has been shown in section 4. 4. 4 that the suboptimal control 
f{ s) resets Ms) to zero if Ms) § 0, and no control will be applied 
if Ms) > 0. The term <^(s) -1) really contributes around the point 
\(s) = 0 a similar effect as the term (q e l) on the behavior of 

the stochastic equation. If the system is operated in the normal 
mode, Ms) tends to be negative. The suboptimal resets the state 
Ms) to zero. This case is analogous to the condition (2) in the opti- 
mal system. If the system is operated in the degradation mode, A(s) 
tends to be positive, and no control is applied. This is analogous to 
the condition (3) in the optimal system where 6 A is large. This anal- 
ysis of the trajectories of the stochastic differential equation is clearly 
confirmed by simulation results as illustrated in Fig. 4. 6 and Fig. 4.1 0. 

B. The Boundary Values A and B 

The boundary values [ A, 7] of the optimal detection sys- 
tem have been derived in (4-234) and (4-190). 

A*fn(g)= fn (~) 

y= Inf 1 - fUbl?] 

L cx(I--k) J 

~ fn (l^L) r fn(qT)' (4-237) 

a 

= fn (S) + fn(rT) 
r 
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The corresponding boundary values [A, B^J of the suboptimal sys- 
tem have also been derived in (4-228). 

A = 0 

2 

B x = in [— i^-T] * fo[r T] (4-238) 

2a 

For a specified T, the boundary values of both systems differ 
only by a translation by fn(q/r). 

A comparison of the system detection performance of the subopti- 
mal system with the optimal system can now be analyzed as follows. 
Consider the basic equation (4-230) of the detection performance for 
the optimal system (denoted by subscript "0"): 


tqU) = / ^ 0 (y,a 1 ,T)p 0 (y) dy- 
X 


y-X 

= f p Q (z) t 0 (z, T) dz 


y e [ X, 7] 


(4-239) 

z e [ 0, y-X] 


where the transformed boundary can be derived from (4-237)* 

2 

a 1 

(7- X) ^ fn [ tt • T] 

2a 

The detection performance of the suboptimal system (denoted by 
subscript "s' 1 ) can be similarly expressed as* 


t s ( t)= T)p s (y)dy y € [0, BjJ (4 ■ 240, 

2 

~ n a 1 

where B-, = In [ — y * T] 

2a 

The time delay in detecting degradation t(y, a^, T) is evaluated 
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on the conditions that the system is operated in the degraded mode 
that the initial state is y, and that the mean time between false 
alarms is specified to be T. The stochastic equation A(s) for the 
optimal system defined in mode can be expressed as 

dA(s) = [g(l + e~^ s) ) + 1 ] ds WTdw(s) A(0) = X (4-241) 

Define Xs) = A(s) - X 

Then (4** 2 41) is transformed into 

r+f 

dX(s) = (1 + g + e ^ S ^) ds +V2~dw(s) X(0) = 0 (4-242) 

2 

~ a-, 

with Xs) defined m the interval [0, fn( s- * T)1 

2<r 

The stochastic equation X(s) for the suboptimal system can be shown 
to be 


dX(s) = [/"(s) + 1] ds +^~dw(s) 


(4-243) 


Since the process X(s) can be assumed on the average to be posi- 
tive under the condition of the degraded mode then t(s) = 0, and 

(4-243) can be rewritten as 


dX(s) = ds +V2~ dw(s) 


X(0) = 0 
2 


(4-244) 


1 

with A(s) defined m the interval [ 0, fn ( — ^ * Ti]. A larger transition 

2<j 

coefficient is present in the stochastic equation (4-242) for the optimal 

system. Hence by application of the comparison theorem the optimal 

system will m general show a shorter time delay to exit through the 
2 

boundary fn(_l__* T) as expressed by the relation 
2(t 2 


^o^ a i» T )= 


(4-245) 


However the difference will be insignificant if the parameter g is 

-'Jifs) 

assumed to be small. The term e ' will initially contribute a large 
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effect, but will show a minor effect when the system is operated in 
the degraded mode with a large value of A(s). A detailed comparison 
of the probability density functions Pg(y) and p g <y) would be quite 
involved, but there is evidence to believe that they will be approxi- 
mately of the same distribution from the results of the analysis of 
the behavior of the stochastic equations of the optimal and subopti- 
mal systems conditioned on the normal mode Hq. Finally, the 
boundaries of both systems have been shown to be the same, there- 
fore the result that the detection performance of the suboptimal sys- 
tem shows a remarkably good approximation to that of the optimal 
system is not surprising. 

Different detection systems of interest have been discussed in 
the literature. They include the detection system of the fixed-size 

/ £* C \ 

Neyman- Pearson method (Shiryaev^ ') and a sequential test devel- 
oped for use by radars detecting an emerging target (Gagliandi and 
Reed^^). These however will not be reviewed here. In the following 
a detection system with the so-called degenerated Wald's sequential 

/ O C \ 

analysis method (Shiryaev ; ) will be briefly discussed. The sys- 
tem has been mentioned before in Section 4. 4. 1. When applied to 
the given problem, the method consists of observing the process A(t), 
A ( 0) = 0, and making a decision at the instants when the trajectory 
-X(t) first exits one of the boundaries. If \(v) = B > 0 where B is an 
"upper'' bound, then one accepts the hypothesis that a degradation is 
present and one verifies this decision. If a degradation has been 
verified, the observation terminates. In the case when it is a false 
alarm, and also for Mv) ~ A § 0, A being the "lower" boundary, the 
observation process restarts with Preset to zero. 

Let T and t be the mean time between two false alarms and the 
corresponding mean time delay in detecting a degradation. Then it 
can be shown that both T and t are functions of the boundaries A and 
B by application of the theory of first passage time problem. For a 
given T the solution for the optimal detection, system is reduced to 
finding an optimal parameter A to minimize t (A, T). Shiryaev has 
managed to prove by a rather cumbersome procedure the following 
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statement: there exists such a T < oc such that for all T = T the 
optimal choice of the lower boundary A is A = 0 in the class of 
Wald's sequential analysis (i. e. with the constraint A g 0, B>0). 

Two remarks have been emphasized by Shiryaev in his approach. 
First Shiryaev claimed that there are no doubts that in reality for 
all T the choice A=0 is the optimal one, but the method of proof en- 

IV 

ables him to establish the result only for a certain bound T. This 

(79) 

conclusion has been discussed later in a paper by Vorob'ev 
Secondly, Shiryaev realized that for the condition A=0 there corre- 
sponds the error probabilities a =0, jS =1, and that from the point of 
view of the testing of statistical hypotheses, it is completely inadmis- 
sible. However, he argued that the probabilities a and jS , by them- 
selves, without taking account of the duration of the separate stages 
of observation, are not reasonable characteristics for the observa- 
tion system, and that a probability of miss alarm j3 close to unity 
in no way discredits the observation system if the duration of the 
separate stages is sufficiently small. 

The above result can be explained as follows: By definition Wald's 
sequential analysis is a single stage decision procedure. The process 
terminates whenever the trajectory exits through the upper or the low- 
er boundary. For the normal system the trajectory will exit through 
the lower boundary with a probability of (1-n), and for the degraded 
system the trajectory will exit through the lower boundary with a prob- 
ability of /3. Since the system is operated a priori m the normal mode, 
then with the condition A(0) to be zero the trajectory will immediately 
exit through the lower boundary A of zero, and by definition it is a 
stage. Since there will be only a finite number of times for trajec- 
tories to exit through the upper boundary, while there are an infinite 
number of times for trajectories to exit through the lower boundary, 
this explains the reason that the false alarm error probability a is 
defined as zero. The miss alarm error probability /3 will be simi- 
larly defined as one. The drawback remarked by Shiryaev has been 
completely removed in the developed suboptimal detection system 
formulation. 
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4. 6 Simulation Results 


To demonstrate the actual numerical performance of the detec- 
tion system described in this chapter, a redundant sensor system 
was simulated. The simulation results consist of three sets. The 
first set is to demonstrate the detection technique of the suboptimal 
system. The second set illustrates the comparison of detection per- 
formance of the optimal, the suboptimal and the linear detection sys- 
tem. The third set shows the isolation of the degraded instrument. 

The first and second set of simulation results are based on a 
single gyro system. The dynamic equations for the gyro drift rate 
model in the discrete case are given m equation (2-4) by 

D G (k+l) = F D Q {k) + q G (k) 

The measurement equations are defmed as 
D (k) = D G (k) + r Q (k) 

D (k) = D G (k) + m + r G (k) 

The statistics of the noises are given as 

E [q Q (k)] =0 , E [q G (k) g G (jO] = Q G* 6 k/ 

E [r G (k) ] = 0 E[r Q (k) r Q ( ^ ) 3 = 


for mode H Q 
for mode 


The measurement noise r G (k) represents the quantization error. If 
Qg is the quantum size of the measurement of integrated rate and the 
increments of angle are calculated by differencing, the variance of 
the error in the increment angle is 


R 


A6 



2 

g 


The variance in the angular velocity over the sampling period A 
is then derived as 
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A set of real gyro test data is used as reference for the gyro 
model in simulation: 

correlation time = 0. 5 hour 

sampling period = 6 minutes 

quantization size = 6 arc seconds 

standard deviation over long measurement 
interval = 2. 5 meru 

A simple computation gives the following set of data: 

/ 

Q g = 2 meru 2 , R Q = 0. 2 meru 2 , F = 0. 8 

The system is normalized so that the following parametric values are used: 

Q g = 1 , R g = 0. 1 , m=4, F = 0.8 , A = 0. 1 

As discussed in Chapter 3, the transformed measurement resi- 
dual r„(k) for the single gyro system in the discrete case for degrad- 
ed mode can be shown to have the normal distribution N(a^, cr) 
with the asymptotic statistics as defined in (3-35) by similar manipu- 
lations : 

a x = (1 + ,F V ) ■ m 
£ 2 = R g /(1-K) 

The values of V x and the steady- state Kalman filter gain K can be ob- 
tained by either analytical or simulated computation: 

. V = -0.98146 , K = 0.91368 
x 

Thus the detection system is designed with the equivalent speci- 
fications to detect a degradation with magnitude a 1 = 0. 86 in the back- 

~ 2 ~ 1 
ground of noise with variance a =1.16. 
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A discussion of the transient value of the transformed measure- 
ment residual r„(k) for degradation detection is required. Suppose 
G 

a degradation of a constant bias m occurred at the (k+l)th sample, 
then the measurement residual r^(k+l) of the implemented system 
at that instant will reflect a bias of full magnitude, i. e. , a^(k+l) =m. 
The magnitudes at the next two samples can be respectively shown 
to be: 


a x (k+2) = ,(1 - F K) m 

and 

a.^ (k+ 3) = (1 - F K (1 - F(l-K) ) m 

and the magnitude will be finally attenuated to its asymptotic value. 
The transient amplitudes of the measurement residuals will give a 
large signal- noise ratio for effective detection, of a degradation. 

The optimal boundaries of the stochastic process A(t) are deter- 
mined by the following specifications: 

(1) For Wald's linear detection system, the false alarm and miss 
alarm error probabilities are specified to be 1%. The upper and low- 
er boundaries B and A are computed as 

B = /n (Al£) = 4. 595 
a 

A = /n (-£-) = -4. 595 
1-Q 

(2) For the suboptimal system, the upper boundary B, for de- 
tecting degradation is computed under the assumption of the same 
specified mean time T between two false alarms as Wald's linear 
system. The boundary B^ can be solved by equations (4-218) and 
(4-221): 

e B -l B 1 

- (B + A - )i - e 1 - B - 1 

l-e A 

B x = 6. 12 
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By (4-224), the mean time T can be computed as* 


T 

A 



(e 1 - B x - 1) 


~ 1200 


(3) For the optimal system, the lower boundary X and the 
upper boundary 7 for detecting degradation are determined by (4-237): 


X = H n(i) 
r 

7' = $n (S ) + / n (r T) 
r 

where the parameter r (4-191) in the discrete-time case is defined as 



2 . A 


The parametric value q of the failure rate is specified to be 0. 001, 
this gives the mean life time T m : 

T = 1 = 1000 
m 

q 

The implemented equation A(t) for the optimal detection system 
is the discrete form of equation (4-55), with some notational changes 
for the single gyro system using an integration algorithm of Euler's 
method: 

A = A 1 + — (r -ia. ) + qA (e nl + l)' 
n ii“ 1 2 xi " J* 

cr 


where A(n) is denoted by A , and r = r(n) is the measurement re- 

J n n 

sidual of the discrete time case discussed in Section 3. 3. 
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The simulated equations A(t) for the suboptimal detection system 
in the discrete form are equations (4-142) and (4-143) implemented 
with reset control logic as discussed in Section 4. 4. 5. 

(1) The recursion equation of the detection system for bias with 
positive polarity is represented by: 


A 


n 


A - + (r 
n-1 n 



and reset A = 0 in next iteration if A <0. 
n n 

(2) The recursion equation of the detection system for bias with 
negative polarity is represented by: 


M = M -. + 
n n-1 


(r. 


n 


+ K> 


and reset M = 0 in next iteration if M >0. It is to be noted that 
n n 

the definition of the likelihood function A^ (and M^) of the suboptimal 
system has been modified by a factor of (o^/a^) [See definitions 
in (4-136) and (4-139)] 

Figures 4. 6 - 4. 9 demonstrate the detection trajectories of the 
suboptimal system. In Figs. 4. 6 - 4. 7, a degradation of bias value 
m = +4 occurs at sample time n = 258. The system detects the degra- 
dation at n = 292 in Fig. 4. 6 with correct identification of a positive 
polarity. This illustrates a case in which the instrument happens to 
send out "good” data even after it became degraded. This is reflected 
by the fact that the system requires more observations to detect the 
degradation. Fig. 4. 7 illustrates the detection trajectory on the basis 
of likelihood ratio function for a degradation of bias value with nega- 
tive polarity. No degradation is indicated, since this trajectory is 
not the correct decision function to apply for a degradation with posi- 
tive polarity. In Figs. 4. 8 - 4. 9 a degradation of bias value m = -4 
occurs at the sample time n = 44. No degradation is indicated m Fig. 
4. 8, since the trajectory is not the correct decision function to apply 
for a degradation of bias value with negative polarity. The system 
detects the degradation at n = 48 in Fig. 4. 9 with correct identification 
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of negative polarity. This illustrates a case where the instrument 
sends out bad data right after degradation. One should not claim a 
better detection performance for the case m Fig. 4. 9, rather the 
difference m sample sizes required for detection reflects the differ- 
ent situation of the actual performance of the real system in two 
cases. The above fact illustrates the advantage of the sequential 
detection system, in which the sample size for detection reflects 
the quality of the actual instrument output data, over the detection 
system based on the predetermined number of observations. For 
the latter case the detection system will lose time in the case illus- 
trated m Fig. 4. 8 by having to wait for a decision at the prespecified 
sample size. In the case illustrated in Fig. 4. 6 the system will 
likely miss the degradation at the test of a prespecified sample size, 
and will wait for the next test to detect the degradation. 

The second set illustrates a comparison of the performance of 
the detection systems. Figures 4. 6, 4.10, and 4. 11 show the com- 
parison of performance of the detection systems, with information 
of the bias polarity given for Wald's linear system and the optimal 
system. A set of identical random numbers is used in the simula- 
tion of the linear, optimal and suboptimal systems. The detection 
performance of the optimal and suboptimal system can be shown to 
be practically the same as illustrated in Fig. 4.10 and 4. 6. A com- 
parison of the detection trajectories of the optimal and suboptimal 
systems m Fig. 4. 10 and 4.6 verifies the'analysis discussed in Sec- 
tion 4. 5. 3. It is to be noted that in the suboptimal system the bound- 
ary value for degradation detection is calculated to be = 6. 12, 
the same as the boundary value (7 - A) used in the optimal system in 
Fig. 4. 10. The threshold value shown in Fig. 4. 6 is modified by a 
factor of (a /a^ ) to be consistent with the definition of the likelihood 
ratio function which was modified for computational convenience. 
The extra time delay in detection by Wald's linear system is clearly 
illustrated m Fig. 4.11. Figs. 4.12 - 4.14 show the performance 
of detecting a degradation with an unknown bias polarity. The deteri- 
oration of detection performance of the Wald system modified by the 
averaging process described in section 4. 4. 5 is clearly shown in 
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Fig. 4. 14, when one compares with that of the suboptimal system as 
illustrated in Fig. 4. 12. 

Figures 4. 15 - 4. 26 show the isolation of the degraded instrument. 

The set of parity equations used in the simulation is given in Table 2. 4: 

• * 

(fi A - fig) • c + (fig + fig) * s = 

- (fi^ + fig) * c + (fig + fig) '3 = 22 

(fi E + fig) • c - (fi A + fig), ♦ s = z 3 

(fi^ ~ fig) " "* ^F^ ® 

(fig + fig) • c - (fig + fig) ’ s = Zg 

■ — ^ _ ^F^ " ® ~ 

where c = cos i> - 0. 851 
s = sin (f> = 0. 526 

For all gyros defined in the normal mode, the dynamic equations 
for each parity equation residual (denoted as z) are defined in (3-32) 
and (3-33): 

x(k+l) = F x(k) + q^(k) 
z(k+l) = x(k+l) + r^ (k+ 1) 

The statistics of the noises are given by 

E[q 1 (k)] = 0, E[q x (k) q^/)] = Q- 6^ = 2 Q q - 

E[r x (k)] = 0, E[r x (k) r^)] = R« = 2R Q • & k£ 


185 



The parametric values are then defined as 

<3=2, R = 0. 2, F = 0. 8, A = 0.1 

Suppose that instrument A has been degraded with a jump in bias 
with value of +10. Figures 4.15 - 4.22 demonstrate the detection of 
degradation by the suboptimal system using parity equation residuals 
Z 1 ” z 4 simultaneous identification of respective degradation 

polarities, while Figs. 4. 23 - 4. 26 correctly demonstrate that the 
parity the parity equation residuals z g and z g are operated in the 
normal mode. In particular Figs. 4.19 - 4.20 illustrate the deterio- 
rated performance for detecting the degradation of the instrument A 
by the parity equation residual z^. In comparison with residuals z^, 

Zg and the residual z^ suffers from an attenuation of the signal- 

noise ratio for detection by a factor The number 

cos <j> 2. 5 

of samples required for detection for the residual z g is 93 compared 
to that of 5 or 6 required for residuals z^, Zg or z^. With the addi- 
tional information of identification of the polarity the degraded instru- 
ment A can be reliably isolated by an efficient decision logic. The 
decision logic can be defined by identifying the parity equation resid- 
uals z^ = +1 (or Zg = -1) and z & = 0 (or z g = 0), and by ignoring the 
parity equation residual z^. Similarly for isolation of the instrument 
B, the decision logic can be defined by identifying the parity equation 
residuals z^ = -1 and z^ - 0 (or z g =0), and by ignoring the three par- 
ity equation residuals Zg, z^, and z g , which magnitudes are all atten- 
uated by the factor sm$. Here the state of the normal mode Hq is 
denoted as "0", and of the degraded mode is denoted as "l" with 
the proper sign identifying the polarity. 
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FIG. 4. 6 DETECTION TRAJECTORY OF SUBOPTI3VLAL SYSTEM (RUN 1) 




LIKELIHOOD RATIO FUNCTION 
(NEGATIVE POLARITY) 



FIG. 4. 7 DETECTION TRAJECTORY OF SUBOPTIMAL SYSTEM (RUN 1) 



LIKELIHOOD RATIO FUNCTION 
(POSITIVE POLARITY) 



FIG. 4. 8 DETECTION TRAJECTORY OF SUBOPTIMAL SYSTEM (RUN 2) 
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FIG. 4.9 DETECTION TRAJECTORY OF SUBOPTIMAL SYSTEM (RUN 2) 


LIKELIHOOD RATIO FUNCTION 




LIKELIHOOD RATIO FUNCTION 



FIG. 4. 11 DETECTION TRAJECTORY OF WALD'S LINEAR SYSTEM (GIVEN BIAS POLARITY) 



LIKELIHOOD RATIO FUNCTION 
(POSITIVE POLARITY) 



FIG. 4.12 DETECTION TRAJECTORY OF SUBOPTIMAL SYSTEM (RUN 3) 



LIKELIHOOD RATIO FUNCTION 



FIG. 4. 14 DETECTION TRAJECTORY OF WALD'S LINEAR SYSTEM (UNKNOWN POLARITY) 



LIKELIHOOD RATIO FUNCTION’ • LIKELIHOOD RATIO FI NOTION 

(NEGATIVE POLARITY)* ^ (POSITIVE POLARITY) 
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FIG. 4. 17 DETECTION TRAJECTORY OF PARITY EQUATION RESIDUAL Z 0 
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FIG. 4. 18 DETECTION TRAJECTORY OF PARITY EQUATION RESIDUAL Z 9 

M 


197 







FIG. 4. 19 DETECTION TRAJECTORY OF PARITY EQUATION RESIDUAL Z 3 



FIG. 4.20 DETECTION TRAJECTORY OF PARITY EQUATION RESIDUAL Z„ 
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FIG. 4. 23 DETECTION TRAJECTORY OF PARITY EQUATION RESIDUAL Z 
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FIG. 4. 24 DETECTION TRAJECTORY OF PARITY EQUATION RESIDUAL Z & 
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FIG. 4.25 DETECTION TRAJECTORY OF PARITY EQUATION RESIDUAL Z 6 



FIG. 4. 26 DETECTION TRAJECTORY OF PARITY EQUATION RESIDUAL Z g 
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CHAPTER 5 


Identification System 


5 1 General Discussion 

The identification system is the decision stage to determine the nature 
of the degradation modes of the operating system Verification of the nor- 
mal mode and classification of degradation modes are the decisions to be 
made in the identification system Verification is necessary because of 
the probable false alarms made in the detection stage or the probable re- 
qualification of a "degraded" instrument Classification identifies the infor- 
mation about the nature of degradation in the form of a constant bias or ramp 
bias change in the drift rate 

The data processing of the identification system is implemented as fol- 
lows After a "degraded" instrument has been isolated, the identification 
system will process a specific parity equation residuals from the parity 
equation set C (Table 2-4). The residual is chosen to givethe highest sensi- 
tivity of the degradation bias signal with respect to the isolated "degraded" 
instrument The basic technique employed for the identification system 
formulation is Wald's sequential probability ratio test procedure An invar- 
iant transformation is developed to transform the ambiguous three-class 
identification problem into a set of pair wise disjoint two -class identification 
problems . 

5 2 The Problem Formulation 

The identification system is characterized by the following features 
First, the "degraded" instrument is operating and remains operating in 
one of the specified modes, no transition of modes is assumed to occur 
In the identification stage of a relatively short duration with respect to 
the mean life time of instruments this will be a reasonable assumption 
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Second, no a priori information about the instrument operating condition 
can be assumed The system must consider all alternate modes and the 
two-sided decision process must be used Identification is a single stage 
decision process The process terminates whenever acceptance of either 
class is decided Third, the reliability of the decision is of main concern 
in the identification system The false alarm and miss alarm error prob- 
abilities are chosen as the performance criteria Since in the identification 
system both boundaries of alternate modes must be specified, therefore 
both error probabilities are well-defined 

The above analysis suggests that Wald's sequential probability ratio 
test (SPRT) is a natural formulation for the identification system The 
original discrete -time case of the SPRT is well-known (Wald^ 0 ^), the 
continuous -time version of the SPRT will be discussed In Chapter 3 the 
stochastic differential equation X(t) for the case of an unobservable process 
in the form of an unknown yet unchanged constant was derived, 

( a i -a n) i 

dX(t) = g (dy ” *2~ (5-1) 

G 

where 


Mt) - tn [j^yi , X(0) = 0 


The observable process is defined as 

dy = adt + adu (t) 

where a = a^ for the degradation mode, or a = a Q for the normal mode. 

A study of the definition of the posterior probability ?r(t) will show 
the fact that the quantity Mt) is the same as the logarithm of the likeli- 
hood ratio function defined in Wald's SPRT process The initial condi- 
tion A(0) with a value of zero reflects a state of no a priori information 
of the system being in either normal or degradation mode 

The performance criteria a and /3 will now be derived by application 
of the theory of first passage problem. Let A and B be the lower and 
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upper decision boundaries for identification of the normal and degraded 
modes Let ofx) and /3{x) denote the error probabilities with respect to 
the process A(t) given A(0) = x, and A < x < B Then by definition the false 
alarm error probability ofx) denotes the probability of the trajectories of the 
process Aft) which is defined in the normal mode and which first exits through 
the upper boundary B Similarly the error probability 3(x) denotes the prob- 
ability of the trajectories of the process Aft) which is defined in the degrada- 
tion mode and which first exits through the lower boundary A 

For evaluation of the error probabilities ofx) and /3(x) defined for the 
specified initial condition x, the backward equation is the appropriate enua- 
tion to apply .Let p^(x;y, t) be the transition probability of the homogeneous 
diffusion process 


dX = n(X) dt + cr( A) dw(t) (5-2) 

then p^(x;y, t) satisfies the backward equation: 

If (5 - 3 > 

Let $(x) be the ultimate probability of absorption of trajectories Aft) in 
one of the boundaries A or B, conditioned on A(0) = x. By Laplace transfor- 
mation of (5-3) and manipulations as in Section 4 5__1, it can be shown that- 

- — *"■ * ’ “ (24) 

$(x) is the solution to the equation : 

2 

-|-<t 2 (x) p(x) = 0 (5-4) 

dx 

The appropriate boundary conditions must be used For example, if </>g(x) 
is the probability that absorption m the boundary B occurs before A, then 
the appropriate boundary conditions are <£g(B) = 1, and $g(A) = 0 

The above technique will be applied to evaluate ofx) and j3(x), since in 
sequential procedures the error probability by definition represents the ul- 
timate probability of absorption of the trajectory Aft) in the boundary The 
stochastic process A^(t) for evaluation of the error probability ofx) is defined 
in the normal mode Hq, and is represented by 
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(5-5) 


2 

1 a l a l 4 

dXg(t) = - ~2 — 2"dt + dw(t) = Hq dt + b dw(t) 

<j a J 


Then the error probability a(x) is the solution to the equation 


1 ,2 d> . dtp n 

T b -t + “o¥ =0 

dx 


(5-6) 


with the boundary conditions «KA) = 0, and i|j(B) = 1 The boundary condi- 
tions imply that the absorption in the upper boundary B occurs before 
the lower boundary A. Substituting the definitions of b and jig into {5-6) 
produces 


d> 

1? 


dx 


(5-7) 


then or{x) can be solved to be 


a{x) 



(5-8) 


Similarly the error probability |3(x) is the solution to the differential equa- 
tion 


£k+ o 

dx dx 


(5-9) 


with the boundary conditions: ^(B) = 0, »Jj(A) = 1. The solution ]3{x) is 
shown to be 


J3(x) = 


A, B-x 
e (e 


- 1 ) 



(5-10) 


It is interesting to compare these results with those in Wald's original 
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formulation. By definition. 


A /n , 1-e 

a - a (0) - 


a B A 
e -e 


f> =f(o) = sYr 

e-e 


(5-11) 


Solving the simultaneous equations one obtains Wald's well-known 
formula connecting the boundaries A, B with the error probabilities a 
and 0 


A = ,n (i a-) 


B = 

a 


(5-12) 


These results have been mentioned m (4-58) of Section 4. 4. 1. In 
the discrete-time case, Wald has managed to derive the same results 
by the approximation of neglecting the excess of the likelihood function 
over the boundaries In the continuous-time case the formula gives the 
exact solution and the derivation is much simpler 

5 3 An Invariant Transformation 

It was shown in Chapter 3 that the following approach is used in the 
design of the detection system 

(1) The unknown bias parameters of the degradation modes are not 
estimated, instead the detection process is constructed on the basis of 
a "design" value, specified by the mission performance requirement 

(2) The degradation mode in the form of a ramp change of the mean 
drift rate is not distinguished from that in the form of a constant bias 
jump Instead only one detection system is designed for the degradation 
mode in the form of the constant bias jump. 

The approach is justified by the application of a comparison theorem 
for diffusion processes. However the above approach cannot be applied 
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in the identification system In the identification system the basic infor- 
mation used for identification of different degradation modes is the like- 
lihood ratio function of the joint conditional probability density defined as 


A 

n 



y | H ,b.) 

i i 


y 1 H ,b ) 

n 3 J 


(5-13) 


where is the ith- degradation mode, H the associated unknown param- 
eter, and (y n ) is the sequence of the observation data The determina- 
tion of the joint likelihood ratio function A n in the identification process 
requires not only the specification of a given functional form for the con- 
ditional probability density of the degradation modes but also the unknown 
values of the associated parameters Thus the parameters must be esti- 
mated for calculating A n Since the observation data come from unknown 
classes of degradation modes, the parameter b must be estimated by 
weighing over all possible classes: 


P (b | y x , • • • y n ) 

_ p(y n |b,y 1 "-y n _ r H i )-P(H i ) + p(y n ib,y 1 , y n _^> H^) • P (H^) 
p(y n ly x > •”y n _ 1 ^H i )> P(H.) + p{y n ty r -•.y n _ 1 ,H ; .)- P <H ) 


p(b ly r -y n _ 


( 5 - 14 ) 


where b represents the unknown parameters in both class, b^ and b. Since 

samples are unclassified, (5-14) cannot be carried out seperately for each 

(34) 

class in general, and the computation is very difficult (Ho et al ) 

To circumvent the difficult computation procedure, an invariant transfor- 
mation is developed to eliminate the effect of unknown parameters on the 
identification process 


The transformation can be stated as follows Let {y^} be a sequence 
of identically distributed (l d ) and independent random variables of normal 
distribution N(a, ct), where a is an unknown mean and cr is given Let {zL} 
be a sequence defined by the transformation 





1 


• • • 


n-1 


(5-15) 


<* + 


^ n-1 /n(n-l) 1 


^ V^7 y » 


Then {z. } is an equivalent sequence of i. d and independent variables 

J 

having the normal distribution N(0, cr) with the same variance and zero 
mean 

Moreover the following relation holds: 

n-1 
2 

J =1 

where 

The proof of the results will be presented in Appendix C. 

The transformation reduces the given data {y} to the transformed 
data {z} , and replaces in effect the original family of joint distribution 
P n (y n J a-) with the new family of distribution P n {z n ,A) Here X = X(a) de- 
notes the parameter corresponding to the original unknown parameter a 
induced by the transformation The transformation leaves the identifica- 
tion problem invariant in the sense that the decision procedure at any 
stage about the original process with an ^unknown parameter can-be trans- 
formed into an invariant decision procedure based on the transformed 
process 

This transformation is useful only for the identification stage, where no 
transition of states exists, i e the observations are random variables 
of identical distribution For the detection stage, where a transition of 
states is possible, the mean cannot be eliminated by this transformation 

The identity (5-16) gives the intuition that the mean is eliminated by 
utilization of the best available information of the updated sample mean. 
For finite storage the transformation must be implemented in the follow- 
ing form 


~2 n __ o 

z - = .5, ty-V 

l-l 


J 


n 


y n = 


n 


2 ?i 

i=l 1 


(5-16) 


'n-1 


v/”n(n-l) 


n-1 

s y 

i=l 1 


n-1 


yj n(n-l) y n 
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In this case only the cumulated sum of past data need be stored 

In Chapter 3 it was shown that if the measurement process z(t) is 
defined to be in the degraded mode with an unknown mean bias, the 
transformed measurement residuals (3-39) will be asymptotically of 
i d. and independent random variables of normal distribution with a 
corresponding unknown mean bias and a given variance In the case of 
the ramp bias the transformed measurement residual (3-40) will be 
asymptotically of independent random variables of normal distribution 
with a corresponding unknown mean of approximate ramp bias and a 
given variance An application of the invariant transformation to the 
measurement residuals will then transform the original process with 
unknown constant or ramp bias into the corresponding process with 
zero or an attenuated ramp bias. 

Original Class Transformed Class 

H x - N(a,cr) N(0, a) 

H 2 : N(bk, cr) N(~Jfck, ct) 

where a and b are unknown constant parameters, k is the sample index 

The attenuated ramp rate -^-bk can be derived as follows Let the 
original process {y} be defined in the degradation mode with a mean ramp 
rate b Then from (5-15): — 


E [ z , ] 
1 n-1 J 


— i— - . [b ( S i)- (n-1) • n . b] 

y/n (n-1) i= 1 

(5-17) 

- — y/ n (n-1) ■ b 

2 

- i (n-1) • b ( if n is large) 

2 


It is noted that a positive ramp bias in the original process { y } is trans- 
formed into half the ramp bias with a negative polarity The presence of 
a ramp rate in the transformed process will give an effective information 
for identification of the degradation mode Hg, even if the parameter of the 
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ramp rate in the original process is of an insignificant magnitude 

To eliminate the effect of the constant bias in the original process 
{y} a simple transformation defined in (5-18) can be designed 

Vi ■ y n - y n -i ; n * 2 < 5 - 18 > 

In this case a ramp bias with rate b of one sample interval in the original 
process is transformed into a constant bias with magnitude b In com- 
parison, the proposed transformation (5-15) will give more effective in- 
formation for identification after a sample size of three observations of 
the original process Since the parameter b of the ramp rate in the prac- 
tical system is usually of an insignificant magnitude, the identification 

capability of a small magnitude b by the transformation (5-18) in a com- 

2 

paratively strong background noise level a is questionable 

It has thus been shown that the ambiguous multi-clas's (i. e , three 
classes) identification process can now be transformed into a set of pair 
wise disjoint two-class identification processes The verification of the 
normal mode is done by processing the measurement residuals of the 
original process The classification of degradation modes into constant 
bias or ramp bias mode is done by processing the transformed measure- 
ment residuals. Both decisions are reduced to a simple testing procedure 
against the presence of a constant mean (with known polarity) A mean 
is considered to be present if its magnitude exceeds a specified value 
B = 5 • a where a is the given standard deviation and 6 is a positive spec- 
ified parameter. 

In Chapter 4 it was shown that the suboptimal detection system can 
simultaneously identify the bias polarity in detecting the degradation. It 
is important to assign the correct polarity to the specified value of the 
decision threshold B for the transformed process It is noted from (5-25) 
that a positive (negative) ramp bias rate in the original process will be 
transformed into a negative (positive) ramp bias rate A specification of 
incorrect polarity will cause the sequential probability ratio test process 
to make a wrong identification decision This is illustrated in Fig 5 1 
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Fig. 5.1. Effectof Polarity on Identification System. 


In this figure, the sample x will be correctly classified into the de- 
gradation mode if the right polarity is assigned for the specified value B of 
the decision threshold for identification. However the sample x will be in- 
correctly classified into the normal mode if the wrong polarity is assigned. 

The block diagram of the identification system is given in Fig. 5.2. 

5.4 Simulation Results 

Figures 5. 3-5. 6 demonstrate the classification of the degradation mode 
into the class of constant bias or ramp bias by application of the technique of 
the invariant transformation {5-18). The classification is done by process- 
ing the transformed measurement residuals. The discussion is based on 
Wald's sequential testing process against the presence of a specified con- 
stant mean. In the simulation results the degradation mode of a constant 
bias of +4 against a ramp bias of rate value of -0. 11 is to be identified. 

The threshold value of the constant mean is specified to be with an 
appropriate polarity. In simulation the threshold value 6 is specified to be 
a value of 0. 5 and specified with a positive polarity due to the negative ramp 
bias. The parameter cr of the single gyro system was defined m section 4. 6. 
Figure 5. 5 illustrates the identification of the degradation mode of the ramp 
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Fig. 5. 2 i Block Diagram of Identification System 









bias. The likelihood ratio function shows that during the early stage of 
identification the decision moves toward the class of the constant bias due 
to the relatively small parameteric value of the ramp rate. As the number 
of samples increases, the weighting factor of the current data in the trans- 
formed residuals become more significant to indicate the presence of the 
ramp bias. The likelihood function reflects the information and identifies 
correctly the class of the ramp bias The identification of the class of 
the constant bias is illustrated in Fig 5. 3 Figure 5 4 and Figure 5 6 
demonstrate the trajectories of the transformed measurement residuals. 

It can be seen, though not very clearly, that the trajectory in Fig 5 4 
shows a zero mean, and the trajectory in Fig 5 6 shows the trend of a 
positive ramp rate This illustrates the justification of the invariant trans- 
formation discussed in section 5. 3. 
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CHAPTER 6 


Compensation System 


6 1 Problem Statement 

After the degradation mode has been identified, a compensation sys- 
tem is designed to recover the degraded instrument performance, if pos- 
sible. Since an accurate knowledge of the model of the degraded system 
is now available, the design of the compensator is simply reduced to that 
of a statistical estimator to estimate the unknown bias of the identified 
mode (H^ or H 2 ) 

The problem of estimating a constant but unknown bias in recursive 
filtering has been extensively studied In this chapter, a technique of de- 
coupled estimator is applied to estimate the unknown bias The conver- 
gence rate of the bias estimation will be studied Finally, some discussion 
of the simulation results will be presented 

6 2 A Decoupled Bias Estimator 
( 29) 

B Friedland has developed a technique of estimating an unknown 
bias by a decoupled estimator. In this technique the bias estimate can be 
computed in terms of residuals in the bias -free estimate, and thus effec- 
tively is decoupled from estimation of the state A block diagram of the 
estimator is shown in Figure 6,1. 

The motivation of this technique is to reduce the dimension of the esti- 
mate of the state variable in the filtering process, thus avoiding the compu- 
tational difficulties, speed as well as numerical inaccuracies, associated with 
large matrices This will be the case when the number of bias terms is 
comparable to the number of state variables of the original problem The 
new state vector to be estimated will be substantially increased in dimen- 
sion by the augmented additional bias terms However, the motivation to 
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Fig. 6. 1. Block Diagram of Bias Estimator 


use this technique for the compensation system design m this thesis re- 
sults from a different consideration. During most of the mission period 
the system will be operated in the normal mode In this case a recursive 
filter is designed in the bias -free condition to generate the input information 
for the detection and identification systems. Once an instrument perfor- 
mance degradation has been detected and identified, the filter should be modi- 
fied to estimate the associated bias value This must be done with as simple 
modification of the original filter as possible so that the detection and iden- , 
tification processes can proceed as before without any complicated reorgani- 
zation. This motivates the use of a decoupled bias estimator 

(1) Computation of Decoupled Bias Estimate 

The results of the decoupled estimator are based on expressing the so- 
lution of the variance equation of the problem with bias present in terms of 
the solution of the variance equation for bias -free estimation and other 
matrices which depend only on the bias -free computations The following 
presentation is essentially a simplified restatement of Friedland's results 

The notations used in the derivation are consistent with those of Chap- 
ters 3 and 4 
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Let the dynamic equations be expressed as follows' 
x(t) = -«x(t) + g(t) 
m = 0 


( 6 - 1 ) 

( 6 - 2 ) 


and the observation equation is expressed as 

z(t) = x(t) + c(t) • m + v(t) (6-3) 

The following notations are used 
x = original state (scalar) 
m - bias term (scalar) 
z = observation (scalar) 

g = process noise, with given statistics of zero mean and 
variance E[g(t)g(r )] = W • 5(t-T) 

v = observation noise, with given statistics of zero mean 
and variance E[v(t)v(T )] = V * 6(t-r) 

The factor c(t) determines how the bias m enters into the observations 
For the designed system, c = 1 for the degraded mode and c = t for the 
mode H 2 

Define a new augmented state vector 

x", ' 
y =[ m I 

The dynamic and observation equations can be written as 


y = A y + G g 
z = L y + v 

where 

A= [o’ o]' Q= [o]* L= U°] 

Application of the Kalman-Bucy filtering theory results in the following 
equations for optimum estimate y of the state y 


(6-4) 

(6-5) 
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( 6 - 6 ) 


y = A y + P L T V 1 (z - L y) 

where the a posteriori covariance matrix P is the solution to the co- 
variance equation. 


P = AP + P A T - P L T V' 1 LP + GW G T 


(6-7) 


The covariance matrix P is partitioned as follows 


P = 


P P 
x xm 

P P 
xm m 


( 6 - 8 ) 


The variance equations then take the form of the following scalar 
equations 


(P + c P ) 

P =-z u P - *2L_ + w 

X X y 


(6-9) 


(P+c*P ) (P+c-P ) • c 

P = t - u 2 SJL j P _ SSI • P (6-10) 

xm 1 „ J xm m 


V 


V 


P = - 
m 


(P + c P )‘ 
xm m 

V 


( 6 - 11 ) 


It is noted that equations (6-10) and (6-11) together are homogeneous in P 


and P . Hence, if 
m 


x;m 


then for all t "> 0, 


P (0) = 0 
xm 


P (0) = 0 
m 


P (t) = 0 
xm 


p m (t) =0 


( 6 - 12 ) 
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(6-13) 


and in this case P x satisfies 


2w P 



The interpretation of the equation (6-12) and (6-13) is that if the bias m 
is perfectly known (P (0) - 0) at t = 0, then by virtue of (6-2), it is perfect 
ly known thereafter and the estimation problem reduces to that in which 
there is no bias The variance equation (6-13) is the same as would result 
if m were known to be zero (with probability 1) . Let the covariance matrix 
in this case be denoted by 



where P is the solution to Eq. (6-7) for the initial condition. 


(6-14) 


P(0) = 


iyo) 

0 


0 

0 


(6-15) 


and P is the solution to Eq. (6-13) with P (0) given. 

X X 


If the bias is not perfectly known, then (6-15) is not the correct initial 
condition for the problem to be solved Let 


P(0) = 


P (0) 
x 


P 


xm 



P (0) 
xm 


P (0) 
m 


where P (0) must be non -zero 
m 

It is noted that m (6-9) - (6-11) the equations for P , P , and P 

n x xm m 

are all coupled and hence must be solved together A transformation will 
now be derived to decouple the solution of the variance equations The 
following result is used that if P is a solution to (6-7) then any other solu- 
tion can be expressed as follows 


P = P + V M V T 


(6-16) 
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where 


V = (A - P h T V 1 L)V (6-17) 

M- -M V T L T V" 1 L V M (6-18) 

where Mis a scalar in the designed system case and V can be partitioned 
as 



Then Eq. (6-17) and (6-18) can be expressed as three scalar equations 


V = ( -w - -#-) V 
x V x 


V = 0 i. e. V - constant 
m m 


(V +V • c) 2 M 2 

and M = (6-21) 

* 

The variance equations have been decoupled It is possible to solve 

for P independent of P and P > then solve V V ,and then M 
x ^ xm m x m 

The actual values of the desired variances are evaluated as 

P = P + V 2 • M 

XXX 


P = V 
xm x 


M V 


m 


( 6 - 22 ) 



The initial conditions are not unique In the important special case in 
which P xm (0) = 0, i e , there is not a priori correlation between the state 
and the bias, a convenient choice of initial conditions is- 
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M(0) = P (0) # 0 
m 


V (0) = 0 
x 


(6-23) 


In this case 


V m = 1 


P = V M 
xm x 


P = M 
m 


(6-24) 


The results of this analysis can now be applied to obtain a form of the 
decoupled bias estimate from the equation (6-6) 


(P + c P ) 

A _ A , X xm / A A > 

x = -w x + (z - x - c m) 


(P + P 


V 

c) 


a . •: xm _,__ m — :: (z _ a _ c A > 


m 


(6-25) 

(6-26) 


The equations (6-25) and (6-26) can be shown to be expressed in the de- 
coupled form by the transformation: 


A ~ A 
x = x + V x m 


(6-27) 


Then the final result for the bias estimate can be expressed as 


• M(v + c r (V Y + c)M 
m T? m + - ' — (z - x) 


V 


V 


(6-28) 


where x is the estimate of x which would be obtained in the absence of bias, 
i e x satisfies 


x = -w x + (z - x) 


(6-29) 


The asymptotic solutions of (6-27) and (6-28) are of interest It can 
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easily be shown by substituting {6-24), and the equations (6-50) and 
(6-54), (6-51) and (6-59) in the next subsection into (6-28) that the asymp- 
totic solution of m can be represented as 

in = m (6-30) 

and therefore the asymptotic solution of x will be 

x = x + V x m (6-31) 

These results have been utilized in the discussion of the transformation 
of the measurement processes (3-12) and (3-25) in Section 3 3 

A schematic representation of the computation of estimates in the 
presence of bias is shown in Fig 6 2. 

The corresponding results for the discrete -time case can be summa- 
rized as follows 

The dynamic equations are defined as 


x(k) = F x(k-l) + q^ (k-1) 
, m(k) = m(k-l) 

The observation equation is 


(6-32) 


z(k) = x(k) + c(k) m(k) + r^k) (6-33) 

where c(k) = 1 in degraded mode H^, and c(k) = k in mode Hg. 

The process noise and observation noise are given with statistics of 
zero mean and variances- E[q^(k) q^(J?)] = Q 5^ and E[ r^(k) * r^($)] = R 6^ 
respectively. 

The derivation for the discrete -time case is more complicated, but 
is based on a transformation of the discrete variance equation as in the form 
of equation (6-16) for the continuous -time case 

P'(k) = P'(k) + U(k) M(k) U T (k) 


where P'(k) is the a priori covariance matrix of the estimate of the aug- 
mented state at time k conditioned on the measurement history 
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{ z(l) . . . z(k-l)} . P'(k) is any solution to the covariance equation of 

P'(k) with P’(0) =£ P'(0). U(k) and M(k) are auxiliary matrices The 
derivation -will then be similarly carried out by partitioning the augment- 
ed matrix P'(k) as in the continuous -time case 

Let P^.(k) be the a priori variance of the estimate of x(k) conditioned 
on the measurement history { z(l) . z(k-l)}, P (k) be the a posteriori 

variance of estimate x(k) conditioned on { z(l) . . z(k)} . The a priori 

variance equation and the posterior variance equation are transformed 
into the following forms as in the continuous -time case, 

P ' (k) = P ' (k) + uj(k) M(k) 

A X X 

P im (k) = U x (k) M(k) (6-34) 

P^(k) = M(k) 

and 

P (k) = P (k) + V?(k) M(k + 1) 

X X X 

P vm (k) = V (k) M(k + 1) (6-35) 

Xlll X 

P m (k > = M(k + 1) 

where 



’u (k)“ 


’ V (k) 

U(k) = 

X 

U (k) 
m 

L. J 

V(k) = 

X 

V (k) 
m 


and M(k) is a scalar quantity computed as 

M(k + 1) - M(k) - p_^(k) 1 H + S^(k) M(k) (6-36) 

The auxiliary factors U (k), V (k) and S(k) are recursively computed 

X X 

as follows* 


V (k) = U (k) - K (k) S(k) 

XXX 


(6-37) 
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where 


S(k) = U (k) + c(k) 

A 

U (k+1) = F V (k) 

X a 

The following initial conditions are used- 

U (0) = 0 , P (0) = 0 , U (0) = 1 
x xm m 

The variance F 1 , P and the bias -free gain K for the bias -free 

XX X 

estimator are computed in the normal manner 

K (k) = P^(k) / (P^(k) + R) (6-38) 

P (k) = (l - K (k)V Pj(k) (6-39) 

P' (k+1) = F2 P (k) + Q (6-40) 

X X 

The corresponding equations for estimates x and m can be written in a 
form similar to the continuous -time case 

x(k) = x(k) + V (k) m(k) (6-41) 

X 

m(k) = (1 - K (k)S(k)) m(k-l) + K (k) ?(k)- (6-42) 

m m 

where 

x(k) = F x(k-l) + K (k) [ z(k) - F x(k-l)] 

X 

and 

r(k) = z(k) - F x(k-l) 

are the bias -free estimate and the residual of bias -free estimation respec- 
tively. The gain K m (k) of the bias -estimator is given by 


K (k) 
m 


M(k+1) [V (k) + c(k)] 
R 


(6-45) 
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(2) Convergence Rate of Bias Estimate: 


For the on-line adaptive system the convergence rate of the bias 
estimate determines the rate of compensation for performance degradation, 
and will thus have a significant influence on over -all system reliability 
and performance In Chapter 2 a preliminary discussion of convergence 
rate of the bias estimate was presented. In this section the condition for 
the optimal convergence rate will be analyzed and the optimal rate of the 
bias estimate will be evaluated 

Since the bias estimate will converge to its true value, as the mean 
square error of the estimate converges to zero, the rate of convergence 
for an estimator is often expressed in terms of mean square error of the 
estimate In the structure of the linear estimator the problem of obtain- 
ing the optimal convergence rate is reduced to design of an optimal gain 
function such that the mean square error at each and every time is mini- 
mized. By definition this is the optimal gam function derived in the 
minimum variance Kalman estimator. 

Thus the optimal convergence rate can be evaluated by the variance 
equations derived for the decoupled estimator The variance P (k) for 
the discrete -time case must be computed by a set of recursive formulas 
It is difficult to get a closed form of the convergence rate of P (k) as a 
function of the time index k in the dis crete -time case. A simpler develop- 
ment and a more readily interpretable result can be obtained for the con- 
tinuous-time case. The variance equation for P (t) in the continuous time 

m 

case is represented in (6-24), (6-21) and (6-23), 

P m (t) = M(t) (6-24) 

M 2 (t) • (V (t) + c(t)) 2 

M(t) (6-46) 


with the initial condition P-^O) = M(0). Thus the problem is reduced to 
solution of an ordinary differential equation (6-46) Substituting (6-24) 
into (6-46) produces 


P (t) = - 
m 


P m (t) (V x (t) + C(t))2 
_ 


P m (0) = P 0 


(6-47) 
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where c(t) = 1 for mode H^, and c(t) = t for mode Bb,. 

The quantity V (t) satisfies the equation, 

X 

p (t) p (t) 

V (t) = - (u + ) V (t) — c(t), V (0) = 0 

x v x v x 


(6-48) 


This is an ordinary differential equation with a time -variant coefficient 
If only the steady-state Kalman filter is implemented, then (6-48) can be 
simply solved Define the quantity 

= the steady state Kalman filter error variance 


0 


oo > 0 


V 


(6-49) 


7 = (w + ~^r) > 0 


The quantity V (t) can be determined with the result 

X 


V (t) = -JL(1 - e _Tt ) 


x 


7 


(6-50) 


for degradation mode H,, and 


V (t) = — |-( 7 t - 1 + e"^ t ) 
X 7 


(6-51) 


for degradation mode 
Define 


W m <t) = PVt5 


m 


(6-52) 


Substituting the solution V (t) (6-50) and the definition W (t) into 


X 

(6-47) produces the expression of W m (t) for degradation mode 


0 \2 


W (t) = W (0) + 
m m 


(l--tL)“ 2 L ( 1 - 4 *) 
* 

— • t + — ■ 

7 • V 


1 • t+-Z X- (l-e'D 


V 


+ 


2 V t' 


(1 - e’ 2 ^) 


(6-53) 
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The convergence rate P m (t) for mode Hj for sufficiently large t can 
be expressed as 


~ 1 


P m<« = k 


1 + k 2* 


(6-54) 


where the quantities and k 2 are defined 


k l = 


2[L) 

1 y y j3 

+ + rr3 


(6-55) 


P m<0> 


7 • V 2 V 7 ' 


(l-i .) 2 
1 , . v 

K. 

2 V 


(6-56) 


The expression of W m (t) for degradation mode can be similarly derived 
by use of the solution V (t) in (6-51) and the definition (6-52). 

X 


w m (t) = W m ( 0 )+ 4 ‘J’_ {U“|-)o- +A (l-e " TC7 )} 2 da 


m 


V 


y 


The result is somewhat tedious, therefore only the solution W. 
large t will be considered. 


m 


(6-57) 
(t) for 


W m (t) " W m (0) + T ‘ + (1 + ^T } 4- 9 • t 2 + 

y ' y ^ y^ 


1 0 - 1> 2 

w_(0)+i — t— 


m' 


(6-58) 


V 


-L , + i.. k - +3 
P m (0) 3 K 2 


where k^ is defined in (6-56). 

The convergence rate P (t) for mode H for sufficiently large t can 

m * 
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then be expressed as 


1 


p m (t) a 


1 . 3 

p (0) 32 X 

m 


(6-59) 


The following conclusions can be derived from (6-54) and (6-59): 


(A) The convergence rate P (t) is also determined by the quantity 
(denoted as -V in (3-21) of Section 3 3)* 


y (P^/Y + co ) 


(6-60) 


which is dependent on the correlation parameter w of the dynamic system. 
For the highly correlated dynamic system with a small parametric value 
io, the quantity 


JL 


- i 


y 


(6-61) 


and 


(1 - l)«l 

7 


in this case a slow convergence rate will be expected 

(B) The convergence rate P (t) for degradation mode H will be much 
faster than the convergence rate for degradation mode H^. 

These conclusions are confirmed by the simulation results 
6 3 Simulation Results 

Figures 6 3 - 6 4 show the estimates of the bias by the technique of 
the decoupled estimator Figure 6 3 illustrates the estimate of a constant 
bias of a value +4. As discussed in section 6 2, the convergence rate is 
shown to be extremely slow For an estimate within 95% of the true value 
a size of more than 800 samples are required. For comparison the typical 
sample size required for detection is about 30 samples as illustrated in 
simulated results in section 4. 6 The saving in time required for detection 
compared to that for estimation will thus be of the order of 25. Figure 6 4 
illustrates the estimate of a ramp bias with a rate value of +0. 1. The con- 
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vergence rate in the ramp bias case is much faster as is expected from 
the analytical expression (6-59). 
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CHAPTER 7 


Conclusions and Recommendations 


7 1 Conclusions and Contributions 

An on-line adaptive technique is developed to provide a self-contained 
redundant-sensor navigation system with a capability to utilize its full 
potentiality in reliability and performance The gyro drift rate is modeled 
as a Gauss -Markov stochastic process The degradation modes are defined 
by changes in characteristics specified by parameters associated with the 
model. The dominant modes resulting in navigation performance degradation 
are shown to be of the form of a jump of a constant and a ramp bias of the 
gyro drift rate 

The adaptive system is formulated as a multi-stage stochastic decision 
process a detection system to detect the existence of a degradation and to 
isolate the degraded sensor, an identification system to verify -a degradation 
and to determine its nature, and a compensation system to recover the de- 
graded sensor performance The inputs to be processed by the adaptive sys- 
tem are the aggregated scalar states of residuals generated by a set of parity 
equations of outputs of redundant sensors 

The detection system is characterized as a process with partial infor- 
mation, m which the state describing the degradation is inaccessible for 
direct observation The problem of solving the partially observable process 
is approached by deriving the stochastic differential equation for the posterior 
probability of the unobservable state conditioned on the a priori information 
and the observable measurement history It is shown that for the optimal 
detection system formulated as a Bayesian problem with additive risk and 
conditionally independent observations the posterior probability is the suffi- 
cient statistic. 
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A detection system, in a class of the linear system of Wald's sequen- 
tial probability ratio test is developed. The feature of the system design 
is utilization of feedback of the uncertainty information indicated by the 
posterior probability The design of the detection system is formulated as 
a combined stochastic control and decision problem The admissible control 
region is defined with reference to the value of the information as an indica- 
tor of system degradation. A suboptimal control scheme is designed to 
avoid the complicated computation for obtaining the solution to the com- 
bined optimization problem formulation The suboptimal control law is 
derived by utilization of the fact about monotinicity property of the risk 
function of detection The suboptimal control for the discrete -time case is 
shown to have the characteristic that the control corrects the posterior 
probability to a constant threshold number, if it is less than the threshold. 

If the posterior probability is larger than the threshold, no feedback is 
used. The difficulty in extending the results of the suboptimal control law 
in the discrete case to the continuous-time case is discussed, and a con- 
trol law based on the concept of e-optimal control is suggested. 

The performance of the detection system is derived by application of 
the theory of first passage times for diffusion processes It is shown for 
a sequential system with one-sided boundary for degradation detection the 
meaningful performance criteria are the mean delay time in detection and 
the false alarm error probability. 

The developed system is very simple for on-line implementation The 
system shows a remarkably close detection performance to that of the op- 
timal non-linear detection system It is shown that the developed system 
can be simply modified to detect the degradation with simultaneous identi- 
fication of its unknown polarity, which is shown to be an important piece of 
information for efficient isolation of the degraded gyro 

It is shown that only one detection system designed for the degradation 
mode in the form of a jump of a constant bias is sufficient for all degrada- 
tion modes characterized by a systematic mean change Moreover, the 
detection process can be constructed on the basis of a "design" value, 
specified by the mission requirement, of the unknown parameter in the 
real system. 
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In the design of the identification system an invariant transformation is 
derived to eliminate the effect of nuisance parameters. An application of 
the technique to the measurement residuals will transform the original pro- 
cess with unknown constant and ramp rate into a corresponding process with 
a zero mean and with an attenuated ramp rate The transformation gives a 
sensitive identification capability even for a degradation with a small ramp 
rate It is shown that the ambiguous three-class identification process can 
be transformed into a set of pairwise disjoint two -class identification pro- 
cess 

A technique of decoupled estimator is applied in the compensation 
system to estimate the unknown bias such that the adaptive system can be 
operated without any complicated reorganization. The long time interval 
required for processing information in the estimation process in comparison 
with that required in the decision process illustrates the justification of the 
multi-stage formulation of the thesis 

7 2 Recommendations 

In this section suggestions for further study will be briefly discussed 

The treatment of the degradation mode having an increase in variance 
of the gyro drift rate will be considered The detection of a change in vari- 
ance can be treated by a simple extension to the theory of sequential analysis 
with correlated measurements to test the hypothesis Hq of the normal vari- 
ance against the hypothesis of a specified "degraded" variance. The test 
procedure consists of observing the likelihood function of the joint conditional 
probability density against two thresholds related to the set error bounds. 

The likelihood function can be recursively generated on-line by the two se- 
quences of measurement residuals derived from two Kalman fitlers modeled 
on hypothesis H n and H, respectively This has been discussed by Newbold 

/j- -i- 

and Ho ' However, the problem of the adaptive system design m this 
case must be handled with some modifications 

First the sequence of measurement residuals is an asymptotically 
stationary but not an independent sequence if the Kalman filter model is not 
based on the true variance One simple way to ensure independence is to 
sample the data at a sufficiently long interval. Since it was discussed in 
Chapter 2 that the increase in variance during degradation of the instrument 
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is gradual and small, and one notices that the sequence in this case will be 
only weakly correlated, then the effect of a moderate increase in the sam- 
pling interval will not likely cause a serious degradation of the navigation 
system performance Secondly, since the variance increase during degra- 
dation is gradual, there is no justification to assume that a degradation mode 
of a change in mean bias will not occur during the same interval. This com- 
plicates the design of the identification system If independence of the 
measurement residuals can be ensured, the invariant transformation devel- 
oped in this thesis can be applied to remove the constant bias mean In 
this case by processing the transformed measurement residuals, the degra- 
dation modes of variance increase and of a constant bias jump can be 
classified. However, there will be an ambiguity in classification between 
the degradation modes of the variance increase and the ramp bias, and 
some other technique or information for their classification is required 
Finally, the problems of the design of optimal and suboptimal systems for 
the on-line estimation of the "degraded" noise variance, for example, by 

( 52 ) 

using analysis of measurement residuals, have been discussed by Mehra 

(1) 

and Abramson It is noticed that the performance of a degraded instru- 
ment due to an increase in variance cannot be requalified, but the updated 
variance may give the information for optimal weighing of the measurement 
data among different instruments in the navigation system, or give an in- 
dication that the instrument will have an impending hard failure 

The technique developed in this thesis can be extended to compensate 
the drift rate non-stationary component modeled as a random walk process 
The random walk component of the gyro drift rate is proposed to be treated 
as an unknown mean of arbitrary form. No modification is required in the 
detection system to augment this component into the gyro drift rate How- 
ever, in compensation for performance degradation, one would have to be 
content to match the unknown complicated function to only one of the two 
given classes, a constant or a ramp mean bias. The compensation will be 
based on a model with either the constant or ramp bias degradation mode 
according to the classification result by the identification system. 

An important area is the study of problems involved in choosing use- 
ful information in adaptive system designs. One direction of research is 
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to introduce into the information theory the element of value (Brillouin 
Howard Stratonovich In the present theory information is treated 

as an absolute quantity which has the same numerical value for any user. 

On the other hand, the value of the information would necessarily be a 
relative quantity, and would have different values for different users. 

This intuition leads one to seek a notion of the value of information 

The design of the detection system in this thesis illustrates an ap- 
proach by relating the concept of the information value to the theory of 
optimal solutions This is equivalent to selecting the information accord- 
ing to a certain figure of merit The question of interest is then to derive 
the most effective information with respect to the cost functional. One 
may conceive of a situation where more information may lower the effec- 
tiveness of choosing the optimal solution The design of the detection 
system illustrates a case in which not all information is useful, and it 
is possible to carry out further minimization of the risk function of 
detection by choice of only useful relevant information 

The above formulation introduces an application of the concept of the 
value of the information In many adaptive systems, the question of 
interest is how measurements should be made so that one can extract most 
effectively all the relevant information which will be utilized in designing 
the adaptive system This problem remains an open question 
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APPENDIX A 


* 

DERIVATION OF WALD’S RISK FUNCTION 


Let 6 (ui) = { 7 - ( u; ) , d ( cu) } be some decision function from the 
results of observations on the process (y(s), sir}. Here t(<o ) is 
the random stopping time for terminating the observation, d(w ) is the 
terminal decision rule. The terminal decision space D° consists of two 
points d Q and d^. The decision d^ will be interpreted as the decision that 
the hypotheses is accepted. Let o(6) = P{d(a>) = d^lH^} and 
/3 {S) = P{ d ( to ) = dg I H^} be the false and miss alarm error probabilities 
at the terminal decision for the decision function 5(u )• Then the risk 
function associated with 6(u ) is defined as 

R U , tt) = *■ {c r ) + £./? U ) }+ (I-*) {c E Q ( t > + b.a (*) } 

(A-l) 

Notice that R(6, ir) denotes the total expected cost starting from the 
initial state t to the terminal decision based on the decision function 6. 

For the case of specified a and 0, a and b are the Lagrange multipliers. 
The coefficient c is the sampling cost per unit time. 

Let.Tr, - P {a = a 1 )y(s) , sit} be the posterior probability that 
is true based on observations |y(s) , si tj . Define the operator* 

E (•) - 7T ■ E ( • ) ,+ (1-tt) • E n < -) 

< 7T 1 U 

Then (A-l) can be written as* 

tt) = n\ c E 1 (r) + a • £(£)}■ + (1-n) | c E Q (r) + b • a (£)} 

(A-l) 
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R (6, it) = c E^ (t) + 7r a • P { d (to) - | } 

+ (1 - 7r)b • P { d (u) = d^ | Hq} 

= c E^ (t) + -it a f P {d (u) = d Q | y (s) , sIt} dP 1 (u) 

+ (1 - 7 r)b * J P { d (w) = d^ j y (s) , s = t} dP Q (u) ( A ~‘ 

Now for any function S(v ,t ) of states t , = tt (t ) at the final 

decision 


7 r . 

E { L_ . S( 7 T r) }= E { S , r) } (A-3) 

*r x (r) r J 

where 7r (r ) is the posterior probability that H. is true. The relation 
(A-3) can be verified as follows 

E { ~Ii_ • S(7 r ,'r)} 

1 r t <r) 

77 

= S f S (it T , t) — ♦ dp.(w) (A- 4) 

r Q 

T 1 - 

where is the set of observations which lead to decision of termination at the 
r 

instant r P^(u ) denotes the conditional joint distribution of observations 
y . . . ,,y generated by the random process { y(s), s = r} Writing 
explicitly P^u) gives 

P x (w) = ^ i (y 1 ) .... /^(y ) 

As discussed under equation (4-36), the "generalized" notation {y } 

A T 

is used, while y denotes the observation Ay^. = (y T - y r ^) in the real 
system. It can be seen that the observations are independent random vari- 
ables and moreover of identical distribution denoted as * ) The latter 
comes from the statement of the Wald's problem that the state a is an un- 
known, yet constant value, and there is no transition of states in the obser- 
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vation process. In this case the posterior probability jt^t ) can be ex 
pressed as 


* d n (y. ) .... d u (y_ ) 

i t (r) = __J: LJ ... 1 r i = 0, or 1 

k ^o 'k d "k ( yi ) •••• d " k <y r > 

Then one can obtain: 

7 r 1 

• dP.(ui) = S v . d// k (y )... 'd// k (y ) 

7T (r) 1 k=0 k K 1 r 


(A-5) 


The relation (A-3) can be proved by substitution of (A-5) into (A-4). 

On using the relation (A-3) the following relation can be established 

J tt P { d (u) = d Q j y(s), s g t} • dPj^ (to) 

= / tt' (u) • P { d (w) -d Q | y (s) , s^t} dP^u) 




by identifjdng S(* ) = JT(t) * P{d(<u) = dQjyfs), s < and E. = E^. 
Similarly 


/ (1 - 7r) P { d (to) = d 1 j y (s) , s ^ t} dP Q (u) 

= f (1 - n (u)) P { d (u) = d 0 [ y(s) , sifJdP^y 
Q 

by identifying S( • ) = (1 - n (r)) • P{ d(w) = d | y(s) , s < r} and E. = E Q . 
Then (A-2) can be rewritten as 

E(6,tt) = cE ff (r) + J [ a tt^ (u) • P{d(u) = d Q | y(s), s g r} 

+ b - (1 - ar T (u))p{d(u) = d 1 | y(s), s S r^dP^ (u) 
g cE^trJt J mm [a - ir^ (to) , b (1 - (w))] dP^ (w) 
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Define 


Then 


K (tt ) = Mm [ a 7r , b (1-97-)] 

R(tf , it ) [cr + K{jr r )] 


(A- 6) 
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APPENDIX B 


DERIVATION OF STATIONARY PROBABILITY 
DENSITY FUNCTION 


It has been discussed in section 4. 5. 2 that at the instant of degrada- 
tion a stationary condition, established in the observation process in the 
normal mode, can be assumed to exist The stationary probability density 
function p(y) generated under this condition is to be derived in this appendix. 
The density p(y) is solved by the Kolmogorov forward equation The forward 
equation for p-^(y, t) can be shown to be 

\ (b 2 p(y,t)} - in(0) • p(y,t)} = (B-l) 

1 ay ay at 

where the observation process X(t) defined in the normal mode is represented 

ty 


dA(t) = /u(0)dt + b dw(t) 


(B-2) 


with parameters 


ju(0) = 



b = 



<j 


The equation (B-l) is subjected to the homogeneous boundary conditions at 
the absorbing boundaries A and B, 

P X (A, t) = p x (B, t) = 0 t>0 (B-3) 
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and to the initial condition A{0) = 0, 


°) = 5(y) (B-4) 

The partial differential equation (B-l) can be solved by application of 
the Laplace transformation 

CO 

P(y, s) = / exp(-st) • p(y,t) dt (B-5) 

0 _ 

Then (B-l) is transformed into 

2 2 — 

1 a i / 9 P 8P \ 

T^2 ay^ + ~^/ = s P “ P&’ °) < B “ 6) 

Substituting (B-4) into (B-6) yields 

a 2 2_ 

t-t'- 1 -# + l£-> ‘ s P= -w ( B - 7 > 

a 3y J 

subject to the corresponding boundary conditions in (B-3). 

The equation (B-7) can be solved by the method of Green's function 
g (y, 0,s). The function g (y, 0,s) satisfies 

2 2 — 

i a i ^ P _ 

i_|_( _+_lP_) _ s p = 0 (B- 8) 

a dy dy 

everywhere except at y=0. The equation (B-8) can be solved as an ordi- 
nary differential equation with s considered as constant Since the station- 
ary condition of the probability density is the only piece of information de- 
sired, the final value theorem will be applied to save the effort of taking the 
inverse transformation The final value theorem is stated as 


lim p(y, t) - lim s p (y, s) (B-9) 

t-»oc S-* 0 
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Multiplying (B-8) by the constant s, and using the relation (B-9) the 
equation (B-8) can be simplified to 

-^-§-+-^2_=0 (B-10) 

d y dy 

where p = p(y) = p(y, co) 

The Green function g(y, 0) satisfies the equation (B-10) everywhere ex- 
cept at y=0, and is subjected to the conditions in (B-3). Let g~(y) and 
g (y) be the solutions to the equation (B-10) in the region A g y < 0 and 
0 < y S B respectively with boundary conditions 


+. 


g (A) = g (B) = 0 


(B-ll) 


— -p 

then p (y) and p (y) of the Green function g(y, 0) take the form 


P (y) = c • g (0) • g (y) 
P + (y) = C • g"(0); g + (y) 


A g y < 0 
B g y > 0 


(B- 12) 


where the associated functions g (y) and g (y) can be easily solved to a 
constant factor as 


g (y) = (e A y - 


1 ) 


g + (y) = (1 - e B y ) 


(B-l 3) 


Substituting (B— 13) into (B-l 2) and normalizing p(y) to a density function 
produces 


p(y) = 


(l-e B ) (e A ~ y -l) 
A(l-e B ) + B(e A -l) 


A g y g 0 


(e A -l) (l-e B ~ y ) 
A(l-e B ) + B(e A -l) 


0 < y g B 


(B-14) 
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APPENDIX C 

PROOF OF PROPERTIES OF INVARIANT 
TRANSFORMATION 


Let { z^} be a sequence defined by 


me li ■a.nsiox-in.cu.iuii 


Vi 5 j— 1 — ( y i +-- + y n -j) - ■ 7 = n = . = y n > < n = 2 ) 
n X yn(n-l) 1 n 1 yn(n-l) n 


(C-l) 


where { y^} is a sequence of independent and identically distributed (i. d ) 
random variables of normal distribution with an unknown mean a and a 
given variance cr . Then { z-} is a sequence of i d. and independent ran- 

J 

dom variables having the normal distribution with zero mean and the same 
variance Moreover the following relation holds 


n ' 1 ~2 

n 

, - >2 - 1 

n 

2 z? = 

j-i J 

2 

i=l 

(y. - y ) ; y = — 
T J n ^n n 

2 y. 

i=l 1 


(C-2) 


This result can be proved as follows Since the {z^} are the result 
of a linear transformation of y^s and {y^} are of normal distribution, 
so { z } are of normal distribution. The statistics of { z } can be de- 

r .1 

rived as follows (for i = n^l): . , 


{E< 


n-1 

2 

i=l 


y.) - (n-l)E(y n )} = 0 


(C-3) 


E t Vi ] = l^TT E y n y 


1 

n(n-l) 


E 


n-1 

{( 2 y 

1=1 3 


) 2 - 


n-1 

2(n-l)y ( 2 y ) 
n 1=1 1 

(equation continued) 
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+ (n-1) 2 y 2 } 


= { (n-l)E (y 2 ) + (n-l)(n-2) • [E(y> ] 


-2(n-l) 2 • [E(y)] 2 + (n-1) 2 E(y 2 ) } 
= E(y 2 ) - [E(y)] 2 = ct 2 


(C-4) 


Here the i. d. and independent properties of { y^} distribution have 
been used in the derivation. Hence it has been shown that { z.} are i. d 
random variables of normal distribution N(0,cr). To prove the indepen- 
dence of random variables of normal distribution, it is sufficient to show 
that the random variables are uncorrelated: 

E[z k ^i] = E t^] = 0 k*j 


E[z z fe ] = E [ 


J 


: 1 s yf - jy,- + i \ ‘ 

v%+T) i=i J+1 yk{kuT 


(C-5) 


* ; , y i' ky k+i n 


1=1 


ii j 

. x . e[ s (y- - y . i ) • 

i/k(k+l) i=l 1 1 


{s , yi- k yk+i } ] 

i=l 


r 1 ' ; s E {(y i - y +1 ) [ 2 y £ - k y k+1 ] } ] 

/j(j+i) JuST). 1=1 3 -^=1 - 


It is easy to show that (k> j) 


e {(y^yj+p f s y, - k y k +P( 

= e {( yi - y J+1 )y 2 + (yi - y J+ i>y J+ i 1 = 0 
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by using the fact that the random variables { y } are i d and independent. 


E(yj - Y-j) = 0 

E(y x y ) = E(y x )E(y ) i * D 

E [yf - y*] = o 

The symmetry of E[z^ z^.] with respect to its indices j and k implies the 
same result for case k<j. This concludes the proof of the first part. 

To prove the identity 


where 


n-1 9 n 

2 z = 2 (y - y ) 
=1 J i-l 1 n 


- *2 


1 11 

y = — 2 y 
n n i=l 


(C-6) 


Consider the equivalent identity 


n - 2 

?. < y i - y n > ' 
1-1 


n-1 


- \ 2 


<y ! - Vl' 


1 n_l 

- [—= ( 2 y - (n-l)y )] 
V n( n- 1 ) i=l 1 n 

This can be proved by some manipulation 


n 


n-1 


s (y i ~ yJ ■ s (y i - y n _i) 
1=1 1 n 1=1 1 ni 


n-1 


= i=! {(y i' y n )Z ' <y i‘ y n-l )2} + (y n - y 0 > 


(C-7) 


= (n-l) (y n -y n _ 1 ) 2 + (y n -y n ) 2 
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Sut <y n - J n ) - (n-1) (y a - y^j) 


Thus the relation (C-7) can be shown* 

^ o n-1 o 

2 (y i " V “ 2 (y i ' Vi > 
i=l 1 n i=l 1 n L 


/ “ \2 

(y n ‘y n ) - ? 

11 , n + (y -y r 

n-1 VJ n 


= <y„ - y n >“ 


= { 


n 


1 r £ , , 2 

"7= [ s y, - n y n ] > 

/n^T i=l 1 n 


or 


1 n_1 2 

[ [ s yr<«-i)y n ] ] 


i/n( n- 1 ) i=l 


n 


n-1 


- \2 _ ~2 
z - 
n-1 


(C-8) 


^ - y n > - = ^ - y n -i> 

1-1 i=l 

By repeated applications of the equation (C-8) and by a summation of both 
sides of the formed equations the following identity can be proved. 

n o n— 1 c 

2 (y i - v = 2 s 

i=l 1 11 3=1 3 


(C-9) 


This concludes the proof of the statements about the transformation 
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